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ABSTRACT 

Arguably the best hope of understanding the tail end of the reionization of the inter- 
galactic medium (IGM) at redshift z > 6 is through the detection and characterization 
of the Gunn-Peterson (GP) damping wing absorption of the IGM in bright quasar spec- 
tra. However, the use of quasar spectra to measure the IGM damping wing requires a 
model of the quasar's intrinsic Lyman-a emission line. Here we quantify the uncertain- 
ties in the intrinsic line shapes, and how those uncertainties affect the determination of 
the IGM neutral fraction. We have assembled a catalog of high-resolution HST spec- 
tra of the emission lines of unobscured low-redshift quasars, and have characterized 
the variance in the shapes of their lines. We then add simulated absorption from the 
high-redshift IGM to these quasar spectra in order to determine the corresponding 
uncertainties in reionization constraints using current and future samples of z > 6 
quasar spectra. We find that, if the redshift of the Lyman-a emission line is presumed 
to coincide with the systemic redshift determined from metal lines, the inferred IGM 
neutral fraction is systematically biased to low values due to a systematic blueshift of 
the Lyman-a line relative to the metal lines. If a similar blueshift persists in quasars at 
z > 6, this bias strengthens previous claims of a significant neutral hydrogen fraction 
at z « 6. The bias can be reduced by including a Lyman-a blueshift in the mod- 
eling procedure, or by excising wavelengths near the Lyman-a line center from the 
modeling. Intrinsic Lyman-a line shape variations still induce significant scatter in 
the inferred xiqm values. Nevertheless, this scatter still allows a robust distinction be- 
tween a highly ionized (xigm ~ 10~^) and a neutral (xiqm — 1) IGM with even a few 
bright quasars. We conclude that if the variations of the intrinsic Lyman-a emission 
line shapes in high-z quasars are similar to those at low-z, this variation will not limit 
the usefulness of quasar spectra in probing reionization. 

Key words: quasars:general - cosmology: theory - observation - ultraviolet: general 
- quasars: absorption lines - quasars: emission lines 



1 INTRODUCTION 

One of the most important frontiers of observational astron- 
omy is the study of the reionization of the universe. The de- 
tails of reionization — when it started, how long it took, and 
what types of objects contributed ionizing photons — hold 
clues to the birth and early history of the various structures 
we see in the universe today. Because of their high luminosi- 
ties and relatively flat spectra, quasars are excellent probes 
of reionization. Many efforts have been made to constrain 
the ionization state of the intergalactic medium (IGM), and 
thus the timing of the tail-end of reionization, using spectra 
of z > 6 quasars discovered by th e Sloan Digital Sky Sur- 
vey (|Fan. Carilli fc Keating! |2006| . and references therein) . 
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Beginning at lower redshift, the forest of Lyman-a absorp- 
tion lines in a quasar spectrum traces the fluctuating neu- 
tral hydrogen density along the line of sight. At z ~ 6, the 
Lyman- £x forest lines blend toget her and saturate, formin g 
the dark Gunn-Peterson trough l|Gunn fc Petersoi] 
indicating a neutral fraction of > 10~^ (|Fan et al.l 
This saturation at such a small neutral fraction limits the 
utility of Lyman-series absorption spectra as direct probes 
of the reionization epoch, though the scattered transmission 
windows in this otherwise dark trough do contain some in - 
formation about the IGM ionization state (|Fan et al.ll2006l V 
Fortunately, absorption is not confined to photons in the 
narrow, resonant core of the line. Photons passing through 
the neutral hydrogen in the IGM can be absorbed in the 
damping wings of the Lyman-a line, far from the cen- 
tral resonance wavelength. This means that the hydrogen 
that creates the Gunn-Peterson (GP) trough also has a 
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damping wing extendi nR redward of the edge of the trough 
l|Miralda-Escudel 1 1998h . If the neutral hydrogen extends all 
the way up to the redshift of the source, then the GP trough 
will extend up to the wavelength of the source's Lyman- a 
emission line. A quasar, however, is an extremely luminous 
source of ionizing radiation. This radiation ionizes a bubble 
of the surrounding IGM. The bubble is large enough (tens of 
comoving Mpc) to extend over a significant redshift interval, 
which pushes the edge of the GP trough to lower redshift, 
opening up a transmission window in the quasar spectrum 
blueward of its central Lyman- a emission wavelength. The 
damping wing of the GP trough extends into this window, 
where its characteristic absorption profile can be detected 
and its magnitude used to constrain the IGM neutral frac- 
tion. Since the damping wing is many orders of magnitude 
weaker than absorption in the core of the line, this technique 
can probe much higher n eutral hydrogen fractions wi thout 
sat uration (ICen fc Haiman 200(?: lMadau fc Rocs 2000l'). 

iMesinger fc Haiman (2007, hereafter ,MH07, ) searched 
for the signature of the damping wing in spectra of three 
z > 6.2 quasars, finding best-fit values for the IGM neutral 
fraction of xigm ~ 1.0, 1.0, and 0.2, at z = 6.22, 6.28, and 
6.4 respectivelyQ The uncertainty in this inferred value is 
large, but they determined a lower limit of xigm > 0.04 for 
the first two sources. In an earlier analysis of the spectrum of 
the z = 6.28 quasar, iMesinger fc HaimanI (l2004f l determined 
2;iGM 0.2 using both the Lyman- a and Lyman- p regions 
of the spectrum. Other attempts to use the transmission 
window to measure iigm have focuse d on the size of the ion - 
ized region and found similar results (|Wvithe fc Loebll2004h . 
Fluctuations in the IGM density and ionizing background 
(as well as uncertainties in assumptions about the quasar age 
and luminosity) rep resent significant sources of uncertainty 
in th ese estimates (|Bolton fc Haehneltl l2007l : iMaselli et al.l 
I2OO7I ). 

There are two main sources of uncertainty in determin- 
ing the IGM ionization state using the GP damping wing. 
The first is the uncertainty in the underlying emission spec- 
trum of the quasar. The damping wing absorption profile 
is seen against the intrinsic spectrum of the quasar, which 
must be modeled accurately in order to accurately calculate 
the optical depth profile. The second source of uncertainty 
is in modeling the IGM absorption profile itself. Both the 
damping wing and the superimposed absorption from resid- 
ual neutral hydrogen inside the ionized region are affected by 
the density structure and ionizing background in the IGM. 
These effects must be incorporated into the model that is fit 
to the observed spectra. 

In this paper we address only the uncertainty caused 
by errors in the determination of the intrinsic quasar spec- 
trum. Since the edge of the Gunn-Peterson trough is at only 
a slightly lower redshift than the quasar itself, the absorption 
profile we want to measure is superimposed on the blue wing 
of the quasar's Lyman-a emission line (nominally centered 
at 1215.67 A in the rest frame). Therefore, determination of 



^ Throughout this paper we use xiqm to mean the neutral frac- 
tion at mean IGM density. This can differ from both the volume- 
weighted neutral fraction and the mass- weighted neutral fraction, 
depending on the clumping factor and density distribution of the 
IGM. 



the intrinsic fiux requires that we model the quasar's emis- 
sion line profile accurately, and be able to fit the model to 
the observed spectrum using only the unabsorbed red mng 
of the line profile. Random errors in this process will add 
to the scatter in recovered values. More worryingly, a bias 
in the fiux modeling (a consistent asymmetry in the profile 
that was not included in the model, for instance), would btas 
the a;iGM results. 

The purpose of the present paper is to quantify both 
the bias and the scatter in the inferred IGM neutral frac- 
tion (and other model parameters) that arise from variations 
of the intrinsic Lyman-a emission line shape. More specif- 
ically, we see k to an swer the following questions: (i) how 
accurately do lMH07l extrapolate the fiux from the red side 
of the Lyman-a line to the blue side, (ii) how do errors in 
the extrapolation affect the IGM neutral fraction determi- 
nation, and (iii) could fiux errors bias the xigm value and 
cause an ionized IGM to mimic a neutral one, invalidating 
their results? 

The first step in answering these questions is to under- 
stand the intrinsic shapes of quasar Lyman-a emission lines. 
High-redshift quasars obviously have too much absorption in 
the blue wing of the line to be useful for this purpose. We 
found that even at z = 2.1, where the Lyman-a line enters 
the Sloan Digital Sky Survery (SDSS) spectral wavelength 
range, the Lyman-a forest is often thick enough to inter- 
fere with a precision study of the line profile. Therefore we 
chose to study a library of spectra observed in the ultra- 
violet (UV) by the Hubble Space Telescope, primarily at 
z < 1. This catalog of spectra, and the fits we perform in 
!j3l may also be useful in the study of quasar environments 
and the quasar "proximity effect" at 2 < 6. We discuss this 
possibility further in f|7] 

The rest of this paper is structured as follows. In !j2l 
we discuss the library of low-2 spectra that we assembled. 
The results of the line profile fits are discussed in SjS] In ^ 
we analyze our ability to predict the fiux on the blue side 
of the line using only the fiux on the red side accurately at 
high-redshift. We describe our simulations of high-redshift 
absorption spectra in !j5l and the impact of extrapolation 
errors on the IGM neutral fraction recovery in ^ In i)6.4l 
we offer our conclusions. Finally, in SjT] we discuss the future 
potential of the techniques outlined here. 



2 STUDYING LOW-Z QUASAR LYMAN-a 
EMISSION LINE PROFILES 

2.1 Assembling a library of quasar spectra 

Our sample of low-redshift, unobscured quasars was selected 
from the available HST archival data in a multi-stage pro- 
cess. Our criteria for selection included the instrument used, 
spectral resolution, wavelength coverage, and signal-to-noise 
ratio. Each entry in the HST data archive (part of the Mul- 
timission Archive at the Space Telescope Science Institute, 
MASlj3) is tagged with "a short description of the target, 
supplied by the observer"0. We search for all Faint Object 



^ http:/ /archive. stsci.edu/hst/search.php 

^ http:/ /archive. stsci.edu/hst/help/scarch_help. html 
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Spectrograph (FOS) and Goddard High-Resolution Spectro- 
graph (GHRS), observations of objects with descriptions in- 
cluding "QSO" or "QUASAR". This yielded 2742 datasets. 
We found 856 unique RA and Dec coordinate pairs in this 
list, making no attempt to eliminate close matches at this 
stage. 

In order to identify our objects and find their redshifts, 
we searched the NASA Extragalactic Database (NED) for 
objects of type "QSO" near each set of coordinates, tak- 
ing the redshift of the closest match (93% were found). We 
next eliminated all observations that did not cover the ob- 
jects' redshifted Lyman- oc wavelength (calculated from the 
NED redshift), which excluded about 44% of the objects. 
We found additional spectral datasets for these same ob- 
jects by searching MAST for observations at the coordinates 
of the already-identified objects, then selecting additional 
spectra containing the Lyman-a line. We eliminated all ob- 
servations that were not taken with a grating (i.e. prism or 
mirror observations). At this point we ran our code to align 
and coadd the selected spectra, as described in Appendix 
\K\ We then eliminated spectra that did not include a desig- 
nated wavelength range surrounding the line (1200-1286 A 
rest frame, cutting the object list by another 15%), and ex- 
cluded spectra with a mean signal-to-noise ratio of less than 
10 (eliminating 42% of the remaining objects). This left 120 
spectra of 112 objects (only one GHRS spectrum, the rest 
FOS). After running our line- profile fits, we eliminated 16 
more spectra (14 objects) for which our line-profile fitting 
code failed to converge (usually in the emission/absorption 
feature detection stage; see Appendix [Bj. After visual in- 
spection of the full-profile fits we cut 21 more spectra (20 
objects), 8 for poor data quality (obvious bias in the flux 
over part of the wavelength range, for instance), 2 because 
the spectra lacked visible emission features, and 11 because 
of excessive absorption, either in the form of Lyman-cc for- 
est lines or broad absorption lines. This left 86 spectra (all 
FOS) of 78 objects as our final sample. Table [1] lists all 120 
automatically-selected spectra, with notes on the 21 spec- 
tra that were cut, as well as on other unusual spectra or 
fits. Figure [T] shows the redshift distributions of all of the 
quasars found in the HST archives, and of those selected for 
this project. We have not, so far, used any Space Telescope 
Imaging Spectrograph (STIS) data, but this would increase 
our sample considerably (perhaps by a factor of two). 

2.2 Defining a spectral model 

In order to characterize the uncertainties inherent in extrap- 
olating a high-redshift quasar's spectrum from the red side 
of the Lyman-a line to the blue side, we performed two kinds 
of fits on our low-redshift spectra: one using the full profile 
visible in the low-2 spectra, and one using only the data 
from the red side of the line (mimicking a fit to a high-z 
spectrum). 

We follow [MH07l and adopt the commonly-used spectral 
model consisting of a power-law continuum, a Gaussian N v 
line, and a double- Gaussian Lyman-a line. Although the N v 
line is actually a doublet (1238.821 and 1242.804 Afl, we fit 
it with a single Gaussian since the lines are quite broad 

* http:/ /physics. nist.gov/PhysRefData/ASD/lines_form. html 
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Figure 1. Redshift distribution of the sample. The upper (grey) 
line shows all of the NED redshifts we found for quasars in the 
MAST Hubble archive. The lower (blue) line is for the sample of 
87 spectra we adopted for this project (see il2.1|l . 

and thoroughly blended. On the other hand, the Lyman-a 
line usually has prominent broad and narrow components 
in quasar spectra (thought to arise from gas close to, and 
relatively farther from the central black hole, respectively), 
so we fit it with two independent Gaussians. Note that the 
components found in our fits do not necessarily correspond 
to physically distinct emission regions, but this combination 
does yield good phenomenological fits to the line profiles, 
which is our primary concern in this paper. 

Our spectral model, illustrated by the best-fit to a typ- 
ical spectrum (the quasar 3C 273) is shown in Figure (2] 
The line centers of the three Gaussians, especially that of 
the narrow Lyman-a component, are shifted significantly 
from their nominal wavelengths (as discussed in il3.2|) . The 
model has three parameters for each Gaussian (width, cen- 
tral wavelength, amplitude) and two for the power-law (in- 
dex and normalization), for a total of 11 free parameters. 
In practice, the power-law is fit first, then a separate fit 
is performed with the power-law parameters fixed and the 
emission line parameters free. In some flts the power-law 
index or line centers are also fixed (see !j3] and [4| . 

2.3 Procedure for line-profile fitting 

Each fit is performed using Levenberg-Marquardt mini- 
mizationl3 We perform our fits separately for the continuum 
and the combined emission lines. First we fit the power-law 
alone to regions of the spectrum chosen to be free of emission 
features. The power-law fit regions are, in the rest frame: 
[1155,1165], [1280,1293], [1315,1325], and [1340,1360] A. 
Some spectra do not include this full wavelength range. In 
any fit to a spectrum that lacks data at Ao < 1165 A we fixed 
the power law index to —1.3, since we have found that ex- 
trapolating the continuum from only the longer-wavelength 
regions can be unreliable. 

Once the index and normalization of the power-law are 

^ Fits use the PDL : : FIT : : Levmar package 

(jhttp;/ /search. cpan.org/~jlapcyrc/PDL-Fit-Levmar/ll based on 
levmar i jhttp: //www. ics. forth. gr/~| lourakis /levmar / ) . 
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Table 1. HST (FOS and GHRS) quasar Lyman- oc emission line spectra. 



Name 


z 


Instrument 


Aperture 


Grating 


S/N 


Comments on spectra 


(NED) 


(NED) 








(mean) 


and fits. 


3C 273 


0.158 


FOS 


B-2 


H13 


19.9 




[HB89] 1427+480 


0.221 


FOS 


B-3 


H13 


12.8 


Absorption feature in blue wing of 














the line (flagged in full fit). 


PG 0953+414 


0.234 


FOS 


C-2 


H13 


11.1 


Absorption feature at line center. 


[HB89] 1156+213 


0.349 


FOS 


B-3 


H19 


10.7 




PG 1049-005 


0.360 


FOS 


C-2 


H19 


13.6 


Excluded due to bad blue edge of 



spectrum. 



The name and redshift z are from NED. This is a sample of the full tabic, which is available in the electronic version. 
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Figure 2. A model for a quasar spectrum (3C 273) in the vicinity 
of the Lyman- a emission line. Components (as given in the leg- 
end) are: broad and narrow Lyman-a Gaussians, a N V Gaussian, 
and a power-law continuum. The centers of the Gaussian com- 
ponents (which are shifted significantly relative to their nominal 
wavelengths based on the NED redshift) are indicated with short 
vertical lines. Complete vertical lines indicate the nominal central 
wavelengths of Lyman- a (solid) and N V (dotted) . 



determined and fixed, the emission line components are fit 
to a region centered on the Lyman-a line at 1215.67 A, plus 
two neighboring continuum-dominated regions. The contin- 
uum regions are included to help constrain the widths of 
extremely broad Gaussian components. The Gaussian fit re- 
gions are, in the rest frame: [1155,1165], [1185,1250], and 
[1280, 1293] A. 

Initial values for the model parameters are given below. 
These were chosen by trial and error to speed convergence, 
and are not necessarily representative of the best-fit values. 
The powerlaw index and normalization start at 



(1) 



where Jpl is the median fiux in the power-law fit region and 
the continuum flux is given by 



Fx 



Q.PL 



(2) 



with Aq = 1215.67(1 + z) A. The Lyman-a narrow- 
component width, center, and amplitude start at 

CTLaO =4 (1 + z) A; /iLaO = 1215.67 A; OLaO = fb- (3) 



The Lyman-a broad-component width, center, and ampli- 
tude start at 

CTLal =8 (1 + z) A; /iLal = 1215.67 A; flLal = Fq. (4) 

The N V width, center, and amplitude start at 

(TNv = 2 (1 + 2) A; ^iNv = 1240.81 A; snv = 2Fg, (5) 

where z is the redshift of the quasar (from NED), Fq is 
the median fiux in the Gaussian fit region, and the flux for 
Gaussian component i is given by 



Fx 



ai exp 



2a? 



(6) 



While we chose to use low-redshift quasars in order to 
study relatively unobscured spectra, some absorption fea- 
tures do appear from both the Galactic interstellar medium 
(ISM) and the IGM. In order to avoid ISM absorption lines, 
we exclude a small region around all of the lines listed 
by IVerner. Barthel fc Tvtletj (|l994l '). The half- width of the 
excluded region is given by w = \/ w'iniu + "^lust > where 
w'min = 1-5 A and winat = 2 FWHM/2.355, with FWHM 
being the full width at half maximum of the instrumental 
line spread function (LSF). 

We use an iterative scheme described in Appendix [B] 
to exclude other absorption features. In order to avoid bi- 
asing the results, we apply essentially the same exclusion 
criteria to positive and negative deviations, meaning we ex- 
clude both excess absorption and emission. In practice very 
few pixels are flagged as emission features (except for some 
spectra in which broad emission lines intrude into the power- 
law fit regions), while most absorption features obvious to 
the eye (or in the ISM list) are fiagged, with few apparently 
spurious detections. Once this feature detection scheme con- 
verges, we perform a final fit on the masked spectrum to 
determine the best-fit parameters. 



3 THE SHAPES OF LOW-Z QUASAR LINE 
PROFILES 

3.1 Quality of the line-profile fits 

Before proceeding to our main goal of studying the impact 
of emission-line shape variations of fits to high-z quasars, in 
this section we describe how well the full profiles can be fit 
with our adopted spectral model. An example of a typical 
fit to a quasar spectrum is shown in Figure |3] (top plot). The 
upper panel shows the spectrum, while the lower panel shows 
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Figure 3. Spectral model fit to the observed spectrum of 3C 273 
using the full line profile (top plot) and using only data redward of 
Lyman- oc (bottom plot). The points in the upper panels show the 
observed fiux (erg s~^ cm~^ A.~^), with error envelope in grey. 
The best- fit model is the black line (often hidden by the data). 
Vertical solid (green) lines indicate expected broad emission lines 
(at the quasar redshift), while vertical dashed (purple) lines indi- 
cate expected narrow ISM absorption lines (at 2 = 0). Red points 
were used for the power-law continuum fit. Blue points were used 
for the line-profile fit. Green points were excluded because of the 
presence of absorption lines. Grey points were not included in the 
fit. The top axis is labeled in the quasar rest frame, the bottom in 
the observer frame. The bottom panels show the residuals divided 
by the quoted formal measurement error. At the bottom-right are 
sideways histograms of the residuals (points) with the expected 
Normal distributions shown by the solid lines. The text in the 
upper panels gives the redshift z; the widths a of the narrow 
Ly-a, broad Ly-oc, and N v components; the height of each com- 
ponent relative to the continuum a/ap; the velocity shifts rishift of 
each component; the powerlaw index p; the continuum amplitude 
at Ly-oc line center Op (erg s~^ cm~'^ A~^); and the reduced 
values x^/^ ^^v the continuum and line-profile fits, respectively. 
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Table 2. Quasar Emission Lines Near Lyman-a 



Line 


Rest-Frame Wavelength 


Cm* 


1175.70 


Lyman- a. 


1215.67 


Nv 


1238.82 


Nv 


1242.80 


Sill 


1262.59 


OH-Siii 


1305.53 


Cii 


1335.31 


Siiv+Oiv] 


1399.80 



Wavelengths are from the SDSS Reference Line List 
(http:/ /www. sdss.org/dr6/algorithms/speclinefits. html) with 
Sill and Cm* fromiVandcn Berk ct al. (2001) and Nv doublet 
from the Atomic Line List 

(http://www.pa.uky.edu/~peter/atomic/) compiled by Peter 
van Hoof. 



the residuals. Pixels included in the fits are shown as colored 
points (red for the continuum fits and blue for the line-profile 
fits). The model spectrum (black solid line, almost entirely 
hidden by data points) is clearly a good fit to the data used 
in the fits, as can be seen visually and by the small reduced 
values. Vertical solid lines (green) indicate the nominal 
rest wavelengths (listed in Tabled of broad quasar emission 
lines. Ve rtical dashed l i nes in dicate ISM absorption features 
listed in IVerner et af] (|l994D . Some of these lines are also 
visible in the residuals, and these and other pixels excluded 
from the fits automatically by our absorption-feature detec- 
tion code are indicated in purple. The bottom-right panel 
shows sideways histograms of the residuals (blue points for 
the fine-profile fit, red for the continuum fit) with the ex- 
pected Normal distributions shown by the solid lines, con- 
firming that the distribution of the residuals matches the 
expected distribution fairly well. The best-fit values of the 
model parameters are printed on the graph. Table[3]lists the 
best-fit model parameters for all of the full-profile fits. The 
amplitudes of the Gaussian components are normalized to 
the continuum fiux at Lyman-oc (which is stated in the flux 
units of the spectra, erg s~^ cm~^ ^~^)- Also given are the 
reduced values and the numbers of degrees of freedom 
for the continuum and line-profile fits. 

Figure U plots the reduced value, /v, for each of 
our fits versus the mean signal-to-noise ratio for the pixels 
used in that fit. A large fraction of the fits (41/87 profile 
and 24/87) are formally unacceptable at the 99% signifi- 
cance level. The /ly values corresponding to this signifi- 
cance level (using the median number of degrees of freedom: 
183 for continuum and 318 for profile fits) are indicated in 
the figure by horizontal lines (at /v = 1.26 and 1.19 re- 
spectively). To the eye, our models generally look like good 
matches to the overall shape of the line profile. Examination 
of the residuals suggests that the formally poor fits are due 
primarily to small deviations at the ~ 5-10% level. This is 
more than adequate performance for our purposes, but it is 
interesting to note that the intrinsic quasar spectra are, at 
some level, more complicated than our models. 
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Table 3. Best-fit spectral parameters from fits to the full line profile. 



Complete table available in the electronic version. 



The first two columns give the NED name of the target, plus the HST instrument, grating, and aperture used to take the spectrum. 
(TLctO is the width of the narrow Lyman-a component (corrected for redshift using the NED value of z, see Table 1). OLctO is the 
amplitude of this component, normalized by the continuum flux Op. dlkO is the shift of the component center from the nominal 
wavelength calculated with the NED redshift. The next six columns give the same quantities for the broad Lyman- oc component (Lai), 
and the N V component, p is the continuum power law index, and ttp is the continuum flux at the nominal Lyman-oc central wavelength, 
in units of 10~^^erg s~^ cm~^ A~^. The last columns give the reduced value /v and the number of degrees of freedom v for the 

continuum and line-profile fits. 



profile fits 
continuum fits 



4.0 
3.5 
3.0 
2.5 
-2.0 
1.5 
1.0 
0.5 
0.0 

5 10 15 20 25 30 35 40 45 50 
mean signal-to-noise ratio 

Figure 4. Reduced values versus mean signal-to-noise ratio 
for the line profile (-I-) and continuum (x) fits. Both values are 
calculated using only the pixels included in each fit. The x^/^^ 
values corresponding to the 99% significance level (using the me- 
dian number of degrees of freedom: 183 for continuum and 318 
for profile fits) are indicated in the figure by horizontal lines (at 
'X^ /i' = 1.26 and 1.19 respectively). 



3.2 Shifts in Emission Components 

Since our primary concern in this paper is the effect of er- 
rors in the extrapolation of intrinsic quasar spectra from 
the red side of the line to the blue side of the line, we pay 
particular attention to any shifts or asymmetries in the line 
profiles that could introduce a systematic bias in the extrap- 
olation. Obviously it is easiest to extrapolate the line profiles 
from the red side to the blue side if they are symmetric and 
consistently centered at a well-defined wavelength. Figure [5] 
shows the distributions of shifts of emission component cen- 
ters from the fits to all of the spectra. The shifts in the line 
center wavelength ^ relative to the nominal central wave- 
length (1 + 2)Ao has been represented as a velocity 



'^shift 



M — (1 + 2)Ao 
{l + z)Xo 



(7) 



where z is the NED redshift. These shifts are often hundreds 
or even thousands of km/s, and show little correlation be- 
tween components. Caution must be exercised in interpret- 
ing these velocity values, since our line-profile model is phe- 
nomenological rather than physical. Nonetheless, many line 
profiles clearly show distinct emission components shifted 
by large velocities relative to their nominal wavelengths and 
to each other. 
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Figure 5. Distribution of the shifts of three emission-line com- 
ponents relative to the systemic redshift. Median values are indi- 
cated by vertical dotted lines. The median shifts are: —303 km/s 
for the narrow component, —413 km/s for the broad component, 
and —45 km/s for the Nv component. A handful of extreme val- 
ues fell outside of the plotted range. 



The nominal central wavelengths are calculated using 
the redshifts from NED, which were mostly determined from 
low-ionization metal lines. It is well known that there is of- 
ten a velocity offset between high-ionization lines and low- 
ionization lines in quasar sp ectra (see discussion and ref- 
erences in IShang et al.l [2007^ ■ There have been relatively 
few studies of shifts in the Lyman-a and Nv lines, in 
part because of the difficulty of separating the components. 
IShang et al.l \20Q± measured velocity shifts of the peak of 
the Lyman-a line relative to the [O ill] 5006.8 A line for a 
sample of nearby quasars (2 < 0.4). This should roughly 
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correspond to our narrow component shifts, since this com- 
ponent dominates in the peak of the overall profile (see Fig. 
[2|. To check this we compared the shifts of line profile peaks 
versus the best-fit narrow component shift in our sample 
and found that (with very few exceptions) they indeed cor- 
respond closely. Their sample has a mean shift of —90 km/s 
with standard deviation of 250 km/s. They also measure the 
asymmetry of the total line profile, finding that the Lyman-a 
line almost always has an excess in the blue wing, indicating 
that the broad component tends to be shifted blueward of 
the narrow component. Their measurements of the shifts of 
other lines suggest that lines arising from higher ionization 
states are shifted by larger amounts. Of the lines they mea- 
sured, they find that the Lyman- oc shift is well correlated 
onl y with the Civ line at 1549.48 A. 

iMcIntosh et all l| 19991 ) on the other hand, found a mean 
velocity shift of —550 km/s between the Lyman-a line and 
the [O III] lines for a sample of quasars at 2.0 ^ z ^ 2.5. It is 
not clear why their distribution is so different, but it may be 
due to sampling a different population of higher-luminosity 
quasars (the luminosity limit was V 18 versus B < 23). 

There is no consensus on a physical model explaining 
these shifts, or indeed on a physical model of quasar broad- 
line and narrow- line regions (BLR and NLR). It has been 
suggested that the shifts of certain lines may depend on 
orientation (e.g. [Richards et al. 1 12002") and that there is ev- 
idence for a ffattened or disc-shaped BLR, with the high- 
ionizatio n lines being emitt ed cl oser to the central b lack 
hole (e.g. iDecarli et al.ll2008l). SeelMarziani et all l|2008l ) for 
a review of BLR models7 |Popovic et al.l ( 20041 ) suggest that 
there are two components of the NLR, which could explain 
the blue shift of the Lyman-a narrow component relative to 
the [O III] lines. In turn, it has been found that the [O ill] 
line i s itself often blueshifted relative to H-P fZamanov et al.' 
|2002| ) and the [On], [Nil], and [Sll] lines (Boroson 2005). 

Regardless of the physical origins of these shifts, we 
will have understand their effects on the flux extrapolation 
in order to avoid biasing the IGM measurements from high- 
redshift spectra. As we discuss in 52] below, these shifts do 
indeed make it difficult to accurately model the intrinsic 
spectra of high-redshift quasars. In H6.3I we attempt to cor- 
rect for the average shifts, but this relies on our knowledge of 
the intrinsic spectra of low-redshift quasars. Without a clear 
understanding of the physical origins of these shifts, any at- 
tempt to apply a similar correction to the observed popula- 
tion of high-redshift quasars would be highly uncertain. In 
particular, we worry that the shifts may be correlated with 
redshift, or with quasar parameters such as luminosity, so 
that the low-redshift sample is not necessarily representa- 
tive of the high-redshift population. However, in |S]we show 
that by careful choice of the wavelength range used in our 
analysis, we reduce the bias induced by these shifts, and in 
H6.4I we argue that the sign of the bias actually strengthens 
current constraints on the IGM neutral fraction. 



3.3 Intrinsic asymmetry of the Lyman-a emission 

Another way to evaluate the asymmetry of the emission 
components (in addition to the shifts of the best-fit-model 
emission components) is to compare the amount of fiux on 
the red and blue sides of the line. Unfortunately, the N v line 
makes it difficult to do this in a model-independent way, so 
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Figure 6. Distributions of ratio of red-side to blue-side Lyman- 
oc flux. In the top panel the flux is split at (1 + z) 1215.67 A in 
the observed frame. In the bottom panel the flux is split at the 
wavelength of the peak of the line. Median values (indicated by 
vertical dotted lines) are 0.73 in the top panel and 1.02 in the 
bottom. Three extreme values between 2.5 and 7 fell outside of 
the range plotted in the top panel. 



we rely on our line-profile fits to distinguish between the 
Lyman-a fiux and the Nv flux. In FigurelHlwe plot the his- 
togram of the total flux on the red side of the Lyman-a line 
to the total on the blue side. The fiux used is the sum of the 
narrow and broad Lyman-a components from the best-fit 
model (and therefore excludes the continuum and N v com- 
ponents) . A value less than one indicates an overall blueshift 
of the flux. In the top panel we have divided the flux at the 
nominal Lyman-a wavelength of Ao — 1215.67 A. As we 
would expect from the biased distribution of velocity shifts 
in Figure [5] there is a systematic excess of fiux on the blue 
side of 1215.67 A. 

This invites the question of whether the Lyman-a pro- 
files are symmetric about some other axis, or are they intrin- 
sically asymmetric? The peak of the line should define the 
axis of symmetry for a symmetric line profile. The bottom 
panel of Figure [6] shows the distribution of red-side to blue- 
side fiux split at the wavelength of the peak of the overall 
line profile. The peak was calculated from the observed spec- 
trum (not the model), by fitting a quadratic to the 10 pixels 
surrounding the maximum observed flux value and taking 
the peak wavelength of that function. As mentioned above, 
this peak corresponds quite closely with the center of the 
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narrow component in all but a few cases (which generally 
show highly unusual line morphology, often heavily blended 
with the N v line or having an absorption line near the peak) . 
There is a wide spread in this flux ratio distribution, but it 
is centered quite close to unity (as measured by both the 
peak of the histogram and the median value, indicated by 
the vertical dashed line). On average, the Lyman-a emission 
is symmetric about the peak of the line, but in individual 
spectra the broad component can be displaced considerably 
relative to the narrow component, producing an asymmet- 
ric total line proflle. In the next section we will explore the 
impact of these shifts and asymmetries on our ability to ac- 
curately extrapolate quasar spectra from the red side of the 
Lyman-a line to the blue side. 



4 EXTRAPOLATING TO OBTAIN THE BLUE 
FLUX FROM FITS TO THE RED SIDE 
ALONE 

In order to mimic fits to high-redshift quasar spectra (which 
are subject to strong IGM absorption on the blue side of the 
Lyman-a line) we next perform fits using only the spectrum 
on the red side of the line. Note that actual high-redshift 
spectra are subject to some additional absorption on the 
red side of the line due to the IGM damping wing. For sim- 
plicity, we do not include the contribution of the damping 
wing in our continuum fits. In principle, the continuum could 
always be self-consistently re-estimated for each model hy- 
pothesis for the foreground IGM absorption. In practice, the 
damping wing on the red side is nearly flat (constant with 
wavelength) and will have a negligible impact on the con- 
tinuum determination, unless the universe is nearly neutral; 
furthermore, the continuum is subdominant to the Lyman- 
a emission line at the wavelengths most important for our 
analysis. In order to make sure to avoid all of the resonant 
absorption, including from any foreground gas falling to- 
wards the quas ar that would absorb li g ht on t he red side of 
the line center l|Barkana fc Loebll2003t ). IMHOTI excluded not 
only the blue side of the line profile, but also the peak of the 
observed line from their line-profile fits. We therefore limit 
our red-side fits to A ^ 1220 A in the rest frame. 

Without the leverage of fiux data in the peak and blue 
wing of the line there is not enough information to simulta- 
neously constrain all three Gaussian parameters for all three 
components in most cases. We attempted to constrain the 
full set of line-profile parameters using red-side-only fits, but 
found that the Gaussian components were often poorly be- 
haved and the resulting best-flt models were too poorly con- 
strained to be u seful fo r extrapolating the fiux on the blue 
side of the line. IMHOTI avoided this problem by fixing the 
central wavelength of each component to the nominal wave- 
length of the line using the systematic redshift determined 
from metal lines. As we noted above, this is not a reliable 
assumption, but it is the most reasonable conservative ap- 
proach, and yielded better matches to the blue-side spectral 
shapes than fits with free line centers. Similarly, we found 
that the extrapolation of the continuum to the blue side of 
the line was more reliable if we fixed the power law index 
to —1.3 in the red-side-only fits (and also in any full-profile 
fits to spectra lacking coverage at Ao < 1165 A). Figure [3] 
(bottom plot) illustrates the type of error introduced by ex- 
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Figure 7. Fractional flux excess in the red-side-only fits versus 
wavelength. The excess is calculated by comparing the red-side- 
only best-fit model (i^ext) to the full-profile fit (Ffun). The top 
panel shows the models fit to the full profiles of all 87 spectra 
(light grey) and the median (dashed) and mean and standard 
deviation (solid) of the set of model spectra. The bottom panel 
shows the fractional difference between the models fit to the red 
side only and the models flt to the full profile for all 87 spec- 
tra (light grey), and the median (dashed) and mean and stan- 
dard deviation (solid) of the fractional flux difference. The verti- 
cal solid line (green) indicates the nominal central wavelength of 
the Lyman-a line. The vertical dashed line (blue) at 1220 A indi- 
cates the blue edge of the red-side-only line profile fit region. The 
vertical dotted lines at 1199 A and 1210 A (red) demarcate the 
typical analysis region for the high-redshift IGM measurements. 
Note that within this wavelength range the extrapolation is more 
accurate at shorter wavelengths. 
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Figure 8. Mean fractional flux excess versus red/blue flux ratio 
of the Lyman-cc emission about 1215.67 A. The fractional flux 
excess (see Fig. [7]l is averaged over the region between 1199 A 
and 1210 A. The distribution of flux ratios is shown in the top 
panel of Figure |6] 



trapolating the flux using flxed line centers. In this case the 
observed line proflle is shifted blueward by ~ 600 km/s. The 
amplitudes of the Gaussians in the red-side fit with fixed cen- 
ters are therefore depressed, and the flux is underestimated 
on the blue side of the line. Table |3] lists the best-fit model 
parameters for all of the red-side-only fits (see description 
of Table [31). 
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Table 4. Best- fit spectral parameters from fits to the red side only of the Lyman- a line. 



Complete table available in the electronic version. 



The first two columns give the NED name of the target, plus the HST instrument, grating, and aperture used to take the spectrum. 
(TLctO is the width of the narrow Lyman-a component (corrected for redshift using the NED value of z, see Table 1). ai^^cO is the 
amplitude of this component, normalized by the continuum flux Op. dlkO is the shift of the component center from the nominal 
wavelength calculated with the NED redshift. The next six columns give the same quantities for the broad Lyman- oc component (Lai), 
and the N V component, p is the continuum power law index, and ttp is the continuum flux at the nominal Lyman-oc central wavelength, 
in units of 10~^^erg s~^ cm~^ A~^. The last columns give the reduced value x^/'^ the number of degrees of freedom u for the 

continuum and line-profile fits. 



Figure [7] shows the differences between full-profile and 
red-side-only spectral fits for all 87 spectra. The top panel 
shows the models fit to the full profiles of each spectrum 
together with the median, mean, and standard deviation of 
the set of model spectra. The vertical solid line indicates the 
nominal central wavelength of the Lyman-a line. The mean 
and median spectra clearly peak a few A blueward of the 
nominal rest wavelength. The bottom panel shows the frac- 
tional difference between the models fit to the red side only 
and the models fit to the full profile for each spectrum, to- 
gether with the median, mean, and standard deviation of the 
fractional fiux difference. The vertical dashed line at 1220 A 
indicates the blue edge of the red-side-only line profile fit 
region. The agreement between the fits is fairly good on the 
red side of this limit, as we would expect, since both fits 
are using the same data here. At shorter wavelengths not 
only are the differences larger, but there is a bias. The spec- 
tra extrapolated from a model fit to the red side only of the 
Lyman-a. Ime tend to systematically underestimate the true 
spectra. 

Both the bias and spread of the flux extrapolation er- 
rors vary widely with wavelength blueward of the 1220 A 
lower limit on the red-side-only spectral fits. The core of the 
Lyman-a line is quite poorly predicted from the red-side fits. 
This is partially because the 1220 A lower limit excludes the 
region where the narrow component dominates, so this com- 
ponent is largely unconstrained in these fits, which is why 
there is such a large variance in the extrapolation error near 
line center. The main reason the bias is largest close to the 
line center (but slightly blueward) is the fact that the line 
centers are fixed in the red-side fits, while the real lines (and 
thus the full-profile best-fit models) have emission compo- 
nents that are often shifted by many A (as we saw in Fig. [5]), 
often to shorter wavelengths. The fixed-center Gaussians in 
the red-side models can match the red fiank of the emission- 
line fairly well, but once the line starts to peak and turn 
over somewhere other than 1216 A, the quality of the fit 
has to decline. Specifically, if the real peak of the line is at 
Ao < 1216 A, then a Gaussian profile fit to the red side of 
the line with a center fixed at 1216 A will tend to under- 
estimate the flux on the blue side of the peak (as in Fig. 
[3]). This explains why the bias (as measured by the median 
error spectrum) is close to zero between 1220 and 1216 A, 
but is largest just blueward of 1216 A where most of the 
observed lines peak. The effect is exaggerated by the fact 
the line proflle is most dominant over the continuum at its 
peak. The bias is smaller farther to the red where the con- 
tinuum becomes more and more dominant, indicating that 



the flxed powerlaw index of —1.3 is a reasonable mean value 
for this sample, though individual spectra still under- and 
over-estimate the flux by large fractions. The slight positive 
bias at Ao < 1195 A is due to the broad Cm emission line 
at 1175.7 A. 

The systematic underestimate of the flux by the red- 
side-only flts will have important consequences for our abil- 
ity to measure the IGM neutral fraction. The vertical dotted 
lines at 1199 A and 1210 A (red) indicate the typical anal- 
ysis region used for the high-redshift IGM measurements 
(see 352} This IGM analysis range avoids the core of the 
Lyman-a line, where the extrapolation errors are largest 
but, nonetheless, on average the red-side-only fits underesti- 
mate the fiux over the entire analysis region. Figure |5] shows 
the relationship between the mean fractional flux excess in 
the analysis region and the red/blue flux ratio of the Lyman- 
a emission about 1215.67 A. As we expected, red-side-only 
fits to spectra with blue shifted Lyman-a fiux tend to under- 
estimate the blue-side fiux, and vice- versa. In ^we explore 
the possibility of restricting the IGM analysis to < 1205 A 
to reduce the bias, but this obviously also reduces the num- 
ber of pixels available to analyze. In tj6.3l we also attempt to 
improve on our naive choice of 1216 A for the Lyman-a line 
centers with a crude correction for the mean blueshift of the 
line components, though we caution again that this requires 
a priori knowledge of (or inferences about) the distribution 
of line shifts in the high-redshift quasar population. 

Our primary concern in this paper is to understand how 
errors in fiux extrapolation like those described above (par- 
ticularly those caused by the asymmetries in the line pro- 
files) propagate to errors in the final determination of the 
IGM neutral fraction. In the next section we describe how 
our line-profile fits are used to simulate high-redshift damp- 
ing wing measurements in order to evaluate this impact. 



5 SIMULATED HIGH-Z DAMPING WING 
MEASUREMENTS 

5.1 Generating simulated high-z quasar spectra 

In order to understand how fiux extrapolation errors will 
affect our ability to measure the IGM neutral fraction us- 
ing the IGM damping wing, we must simulate high-redshift 

^ We will be careful throughout this paper to distinguish the 
optical-depth fits used to measure IGM properties from high- 
redshift spectra from the line-profile fits used to study or extrap- 
olate the intrinsic spectrum of a quasar. 



© 2009 RAS, MNRAS 000, 000-000 



10 R. H. Kramer and Z. Haiman 



quasar spectra. We begin with the full-profile spectral model 
fits described in ^ Each best-fit model is used as the in- 
trinsic spectrum of a hypothetical high-z object. We use the 
line-profile models instead of the actual observed spectra of 
the low-z quasars so that we can match the expected noise 
and instrumental resolution of a high-z spectrum. In order 
to make sure that our fiux models capture all of the fea- 
tures relevant to the IGM neutral fraction determination, 
we also carried out an analysis using the observed spectra 
(with flagged absorption features, but not emission features, 
replaced by the model flux values) as the intrinsic spectra 
of the simulated high-z quasars. We found that the results 
did not differ significantly from our analysis using the model 
line profiles as the intrinsic spectra. 

We then simulate absorption by neutral hydrogen in the 
IGM (and instrumental effects) to generate mock absorption 
spectra on which we can test our techniques for recovering 
the IGM parameters. We model the density field along lines 
of sight through the high- z IGM using the probability distri- 
bution function (PDF) of lMiralda-Escude. Haehnelt fc ReesI 
l|200d ): 



Pv(A)dA = AA-'^ exp 



-(A- 



-2/3 



cf 



2 (25o/3)2 



dA, 



(8) 



where A = (p/po) is the density in the IGM normalized by 
the mean density, and we use the z = Q values A — 0.864, 
So = 1.09, 13 = 2.5, and C = 0.880 for aU of the constants. 
Our conclusions should be fairly insensitive to small changes 
in these values, or even in the form of the PDF. The crucial 
assumption we make is that the density PDF is fairly well 
known. For actual measurements, uncertainties in the form 
of the PDF would be an additional source of error that we 
do not address here. 

To construct density profiles A(r) along the line of sight, 
we generated independent random values of the density from 
Equation |8] for each patch of IGM. In order to roughly ac- 
count fOT_£h£toionization- induced smoothing in the density- 
field (|Gnedin|[2OO0l ). we set a minimum co-moving size of 
1 Mpc on IGM density patches. All pixels falling within 
the same isodensity patch receive the same IGM density 
value. We do this in order to avoid having an unrealistically 
high number of independent data points in our simulated 
high-resolution spectra. In our mock absorption spectra we 
have roughly 2 pixels per isodensity patch, and roughly 20 
patches are included in each damping-wing analysis. The 
details of the IGM correlations should not matter a great 
deal as long as they are short-range (a small fraction of the 
size of the ionized region) , especially since the instrumental 
FWHM we use is comparable to the isodensity patch size. 
We have tested this assumption by repeating the analysis 
with isodensity patches of 0.25 Mpc co-moving. The scatter 
is reduced slightly due to the larger number of independent 
samples of the optical depth profile, but our overall conclu- 
sions are unaffected. 

Next we calculate the neutral hydrogen density assum- 
ing ionization equilibrium. Three important parameters en- 
ter into the calculation at this stage. The ionized region size 
is -Rhii- Outside of this region, the ionizing fiux is uniform, 
and is parametrized by xigm, the equilibrium neutral frac- 
tion at mean IGM density. Inside the ionized region, there is 
additional flux from an ionizing point source (the quasar), 
parametrized by Xici, which is the equilibrium neutral frac- 



tion (including both background and quasar flux) of mean- 
density gas at a reference point rrcf = 35 Mpc comoving 
away from the quasar. 

Physically, the radius i?Hii is set by the quasar's ionizing 
photon luminosity and its age. The higher the rate of ioniz- 
ing photon production and the longer the quasar has been 
shining, the larger its ionized region will be (at least until 
the recombination rate inside the sphere balances the rate 
of photon emission, which occurs only at times much l onger 
than the expected quasar lifetime [Cen fc Haimanll2000l ) . The 
neutral fraction at a given point inside the ionization front 
Xrei scales with the luminosity of the quasar. 

We define a normalized ionization rate 



7 



(9) 



where F is the number of ionizations per unit time per neu- 
tral hydrogen atom, riH is the total (neutral and ionized) 
hydrogen density, and as = 2.59 x 10~^^ cm^ s~^ is the 
case B recombination coefficient at T = 10* K. The nor- 
malized background ionization rate required to maintain an 
equilibrium neutral fraction of xigm is then 



(1 



7BG = . (10) 

a;iGM 

In order to maintain a neutral fraction aj^ef at the reference 
distance, the additional normalized ionization rate due to 
the quasar fiux must be 



7rcf 



(1 - X,,cf 
^rcf 



7BG. 



(11) 



Therefore the total ionization rate as a function of luminos- 
ity distance from the quasar r (inside the ionized region), 
is 



l{r) = 



nH(rrcf) 



7rcf 



+ 7bg 



(12) 



where the ratio of the mean IGM hydrogen density at Viai 
to the local hydrogen density at r is given by 



A(r)-^ 



1 + z(rrc() 



l-|-z(r- 



(13) 



This factor simply compensates for the fact that 7rcf and 
7BG are defined with reference to the mean IGM density at 
Trcf. Outside the ionized region (r > -Rhii), the quasar con- 
tribution is zero {'yid ~ 0), so the ionization rate is simply 



7('^) = ri-7BG- 



(14) 



The equilibrium neutral fraction at distance r from the 
quasar is 



x{r) - 
with 



-b - - 4 



-2-7(r). 



(15) 



(16) 



Finally, the optica l depth due to resonant absorption is 
(lMiralda-Escudelll998l ) 



rr(A) = To x{r) 



0.03 



Ho{l + z 



,3/2 



Hiz) 



1 + z 



3/2 



(17) 
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where tq = 2.1 x 10^, r and z are the distance and red- 
shift corresponding to wavelength A, H{z) is the Hubble 
constant at redshift z (see, e.g. iHoed Il999h . and ho = 
Ho/IQQ km s^^ Mpc'^ We use 0.^ = 0.0462, Q.^ = 0.233, 
Oa = 0.721, =^c. + ^6, fife = 0, fcp = 0.701, and the 
hydrogen fraction X = 1 - 0.240 (|Komatsu et al.ll2008l ). 

Inside the transmission window in the quasar spectrum 
(corresponding to the ionized region) , at wavelength A there 
is both resonant absorption by hydrogen at z = A/Aq — 1, 
and absorption by the extended damping wing ra of hydro- 
gen at lower redshifts in the Gunn-Peterson trough. We cal- 
culate Td using an analyti cal expression fo r the r ed wing of 
the GP trough derived by iMiralda-Escudi l| 19981 . equations 
11 and 12). 

The total optical depth inside the transmission window 
is the sum of the resonant contribution from the residual 
neutral hydrogen inside the ionized region, and the damping 
wing absorption from the hydrogen in the IGM, 



r(A) = r.(A)-f rd(A), 



and the transmitted flux is 



Fobs(A) = Fi(A)e-^(^\ 



(18) 



(19) 
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A/ A (observed wavelength) 



8820 



where -F'i(A) is the intrinsic spectrum of the quasar. 

Finally, we smooth the mock spectrum with an instru- 
mental line spread function and add noise. We use a spec- 
tral resolution of ii = 2000. The spectrum is smoothed 
by Gaussian convolution with a kernel of width ctlsf ~ 
Xo{l+z)/ {2.355 R). The pixel size is AA = \o{l+z)/{2.5 R). 
We normalize the spectral models to have the same intrinsic 
continuum flux Fo at Aq. Our noise model, chose n by hand 
to m imic a typical spectrum of a high-z quasar l|Fan et al.l 
I2OO6I ). is a combination of a shot-noise term and a constant 
background term, giving flux variance (in normalized units) 
a|.(A) = e%F{X) + eic with ep = 10"^ and cbg = 10"^. 
This rough, order of magnitude estimate of the noise level 
suffices, since, in practice, instrumental noise has very little 
impact on the fits due to the much larger fluctuations in the 
observed flux caused by density variations in the IGM. 

Figure |9] shows the density, optical depth, and flux pro- 
files for a model high-redshift quasar. The effects of both the 
uniform ionizing background and the quasar ffux are evident 
in the deficit in the neutral hydrogen density relative to the 
total hydrogen density. The optical depth is much lower in- 
side the ionized region, allowing the quasar fiux to reach us. 
With these parameters, the resonant absorption dominates 
the opacity in high-density regions, while the damping wing 
dominates in low-density regions, setting an effective opti- 
cal depth ffoor. The plot of the spectrum (in the bottom 
panel) demonstrates that the transmission window is super- 
imposed on the blue wing of the quasar's intrinsic Lyman- 
a line. The flux is entirely absorbed in the Gunn-Peterson 
trough (at A < 8730 A), but a large fraction is transmitted in 
the wavelength range corresponding to ionized region. The 
Keck spectrum of the 2 — 6.28 quasar J1030-I-054 is qualita- 
tively quite similar, differing mostly in that the transmission 
window is slightly smaller and instrumental broadening has 
smoothed the flux variations somewhat. 



Figure 9. Generating a mock absorption spectrum: density (top), 
optical depth (middle), and flux (bottom) profiles along the line 
of sight to a hypothetical z = 6.28 quasar. The top panel shows 
the total (phi black) and neutral (phIi blue dotted) hydrogen 
density normalized to the mean density of the IGM. The model 
parameters are -Rhii = 40.5 Mpc, x^cf = 10~^'^, and xjqm = 0.1. 
The middle panel shows the damping wing and resonant optical 
depths, and their sum. The extreme optical depths in the Gunn- 
Peterson trough (at A < 8730 A) are above the range of this plot. 
The bottom panel shows a model intrinsic spectrum (dotted) and 
mock absorption spectrum (solid black). A Keck sp ectrum of the 
z = 6.28 quasar J1030-I-054 (grey) is also shown llBecker et al.l 
l200j) . Note that the model was not fit to this spectrum, it is 
shown merely for comparison. 



5.2 Analysis of the mock spectra 

In order to test the impact of intrinsic Lyman-a line shape 
variations on the recovery of IGM and H 11 region parame- 
ters from a high-z quas ar spec trum, we implement a version 
of the method used bv IMHOtI The first step is to extrapo- 
late the flux by fitting a spectral model to the red side of the 
Lyman-a line, as described in H2.3I We then calculate the 
optical depth by comparing the extrapolated spectrum to 
the mock absorption spectrum and solving equation [19] for 
the optical depth, substituting the extrapolated fiux Fgt for 
the intrinsic fiux and the observed fiux Fobs for the trans- 
mitted fiux: 

robs = In f (20) 

V ^obs / 

An overestimate of the intrinsic fiux results in an overesti- 
mate of the optical depth, and vice- versa. Figure [10] illus- 
trates this effect. The top panel shows the intrinsic spec- 
trum of a mock high-redshift quasar (based on the observed 
spectrum of 3C 273), the mock spectrum after absorption 
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Figure 10. Error in the mock "observed" optical depth profile; 
Flux (top) and optical depth (bottom) on the blue side of the 
Lyman- <x line. The top panel shows the intrinsic (unabsorbed) 
flux, flux extrapolated from the red-side fit, and mock absorption 
spectrum for a model quasar at z = 6.28 (based on the observed 
spectrum of 3C 273). The bottom panel shows the input model 
optical depth (used to generate the spectrum), and the output op- 
tical depth inferred by comparing the mock absorption spectrum 
and the extrapolated fiux. The mismatch between these lines is 
caused by the mismatch between the intrinsic spectrum of the 
quasar, and the spectrum extrapolated from a fit to the red side 
of the Lyman- oc line. The vertical dotted lines (at 1199 A and 
1210 A in the quasar rest frame) indicate the typical analysis 
region for the high-redshift IGM measurements. 



Once we have an optical depth profile, we need to fit a 
model to it to recover our parameters of interest. Because the 
density fiuctuations in the IGM cause large flux variations, 
this is not a trivial task. IMHOTI solved the problem by group- 
ing the flux values into three wavelength bins, then using 
the Kolmogorov-Smirnov (K-S) test to compare the distri- 
bution of flux values in each bin with a distribution derived 
from simulations. The product of the three K-S probabili- 
ties p then served as the relative likelihood estimate for that 
combination of parameters. They then compared observed 
quasars to a grid of models using this technique, selecting 
the model with the maximum p- value as the best flt. Optical 
depth values above about 6 are not measurable from a real 
spectrum because the flux drops below the detection limit. 
We therefore set any r > 6 to Tmax ~ 6. Note that the K-S 
test is not strictly valid with censored data, and assumes all 
values are independent, neither of which holds in this case. 
Therefore care must be taken in interpreting the p-values. 
We rely on Monte-Carlo simulations to avoid depending on 
the p-values to determine confidence intervals. 



We will distinguish between our grid of "model" opti- 
cal depth profiles, used to make the canonical distributions 
for the K-S test, and our set of "mock" profiles, which we 
are analyzing as if they were observed high-z quasars, even 
though they are all generated using the same techniques 
outlined above. The only difference is that the model opti- 
cal depth profiles are generated without reference to a spec- 
trum, while the mock profiles use the extrapolated spectra, 
and thus incorporate any errors in the flux extrapolation. 



At each point in our parameter-space grid we gener- 
ate 100 model profiles, and we use each real low-z quasar 
spectrum to generate 200 mock optical depth profile ob- 
servations using a single set of input parameters, making 
a total of 86 x 200 = 17200 mock profiles. We then run 
the optical depth analysis on each mock spectrum, and 
track the distribution of the recovered best-fit parameters 
{Rmi, iref , 2;igm). 



in the IGM, and the flux extrapolated from a fit to only 
the red side of the Lyman-oc line. It can be seen that the 
fiux extrapolated from red-side-only fit in this case (as is 
typical) underestimates the intrinsic spectrum of the model 
quasar. When the extrapolated fiux is used to calculate the 
optical depth profile (shown in the bottom panel), the er- 
ror in the fiux extrapolation results in errors in the inferred 
optical depth profile. Within the analysis region typically 
used for high-redshift IGM measurments (1199-1210 A in 
the rest frame), there is a systematic underestimate of the 
optical depth which is largest at longer wavelengths, where 
the extrapolation error is largest. Note that there are re- 
gions of the spectrum where the absorbed flux exceeds the 
extrapolated flux by signiflcant fraction. This is outside of 
the line-profile fit region as we have defined it here for high- 
z fits (Ao > 1220 A), but in principle observed absorbed 
fiux values on the blue side of the line could be included 
in the fits as lower limits on the intrinsic flux, which would 
decrease the underestimate of the flux somewhat. 



By default, the maximum wavelength of the analysis 
range is Amax = 1210 A in the rest frame. IMHOTI excluded 
pixels closer to the Lyman- a line center to avoid the bi- 
ased environment close to the quasar. This has the added 
advantage of avoiding the most problematic area for flux 
extrapolation — the narrow-component-dominated core of 
the line (since the narrow component is less well constrained 
in the line proflle flts and is affected more by shifts in the 
line center). The minimum wavelength is chosen by flnd- 
ing the bluest pixel with r ^ 6, then extending the overall 
wavelength range blueward by 5%. The inclusion of these 
extra "dark" pixels is important for the statistical com- 
parison with mock spectra, since density variations change 
the exact wavelength at which the optical depth exceeds 
T — 6 for different spectra with the same value of Rmi- 
With Rmi = 40.5 Mpc, the typical range is roughly 1199- 
1210 A, divided evenly into three bins of about 15 pixels 
each. 
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6 THE IMPACT ON PARAMETERS 

ESTIMATED FROM HIGH-Z SPECTRA 

6.1 Parameter Recovery with Known Intrinsic 
Spectra 

In order to test recovery of IGM parameters without ex- 
trapolation errors, as a sanity check we first calculate a set 
of mock optical depth profiles using the "correct" intrin- 
sic model spectra in Equation [20] instead of the extrapo- 
lated spectra Fm . We then analyze the profiles as described 
above. We used only 4 different intrinsic spectral models 
(chosen at random from our set of full-profile fits) for this 
purpose, since the shape of the underlying spectrum has lit- 
tle impact on the parameter recovery if it is perfectly known. 
Figure [TT] shows the distribution of the best-fit values from 
the 800 mock optical-depth fits in our 3-dimensional param- 
eter space (200 random IGM density profiles times 4 intrin- 
sic spectra). The input parameters are -Rhii = 40.5 Mpc; 
Xrei = 10~^'^; iiGM ~ 0.1. The shading of each square indi- 
cates the number of best-fit values that fell within that cube 
in parameter space (darker shading indicates larger num- 
bers). The top group of graphs show slices through our pa- 
rameter space in planes of constant -Rhii. The middle group 
shows slices of constant x^^f- The bottom group shows slices 
of constant a;iGM. The regions of darker shading trace out 
degeneracies between all of the parameters, which are eas- 
ily understood. Increasing either a;iGM or a^rof increases the 
optical depth, so these parameters are anti-correlated (top 
group of graphs). When the best-fit value of the IGM neu- 
tral fraction is overestimated, the internal neutral fraction 
tends to be underestimated. On the other hand, increasing 
-Rhii moves the damping wing away from a given wavelength 
coordinate and decreases the optical depth, so the ionized 
region radius is positively correlated with the other two pa- 
rameters (middle and bottom groups of graphs). 

The density fluctuations along the line of sight induce 
a wide scatter in the recovered values, which follows the de- 
generacy contours we described above. Even so, the distribu- 
tion of best-fit parameters in Figure [TT] is peaked at exactly 
the coordinates of the input parameters. The top panel of 
Figure [T^ shows the marginalized distribution of recovered 
2;iGM values. Black lines show the cumulative distribution 
function (CDF) and its complement. Where they cross (at 
50%) is the median value. While the marginalized distribu- 
tion peaks slightly above the input value, the median recov- 
ers the input value (indicated by the vertical dashed line) 
exactly. The secondary peak at log(a;iGM) = —2.0 represents 
the integral of the long negative tail of the distribution which 
is truncated by our finite logarithmic grid (which we must 
keep small for computational speed). 

Because of our coarse parameter-space grid it is diffi- 
cult to quote conventional confidence limits, but we can see 
from Figure [12] that 80% of the marginalized neutral frac- 
tion distribution is in the interval log(xiGM) ~ —1.0 ± 0.6, 
90% of the distribution is at xigm ^ 0.22, and 100% at 
a;iGM ^ 0.46 (none of the 800 best-fit values implied a neu- 
tral IGM). Similarly, 88% of the internal reference neutral 
fraction values (distribution not shown) are in the interval 
Iog(a;ref ) = —5.5 ± 0.2, and 93% of the HII region radii are 
within -Rhii = 40.5 ± 2 Mpc (the distributions of both of 
these parameters peak and have their median at the input 
value). These represent the approximate confidence limits 
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Figure 11. Parameter recovery map using perfectly known in- 
trinsic spectra. Each panel is a slice through parameter space. 
The shading of each square indicates the number of best-fit val- 
ues that fell within that cube in parameter space (darker indi- 
cates larger numbers). The input parameters (-Rhii = 40.5 Mpc, 
^ref = 10~^'^, Xigm = 0.1) are indicated with a white point. The 
top group of graphs show slices of constant H II region radius. The 
middle group shows slices of constant internal reference neutral 
fraction. The bottom group shows slices of constant IGM neutral 
fraction. Note that the distribution peaks at the correct values, 
however there is a large scatter induced by density fluctuations 
along the line of sight. 
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Figure 12. Marginalized distribution of estimated xiqm values 
with perfectly known (top) and extrapolated (bottom) intrinsic 
spectra. The histogram (axis scale on the left) shows the marginal- 
ized distribution of recovered xjqm values. Black lines show the 
cumulative distribution function (CDF), i.e. the fraction of re- 
covered values at a; < xiqm (increasing curve, axis scale on the 
right) and its complement (decreasing curve). Input parameters 
are -Rhii = 40.5 Mpc, x^^f = 10~^-^, xjqm = 0.1. The vertical 
dashed line indicates the input value. The median value occurs 
where the CDF and its complement cross. 



that could be placed on these parameters from a single high- 
z quasar spectrum in the limit of perfect knowledge of the 
intrinsic quasar spectrum, the density distribution in the 
IGM, and the radial profile of the background ionizing fiux. 
We find similar results in simulations with xigm ~ 0.2 and 
known intrinsic spectra. For instance, 98% of the results are 
at a;iGM ^ 0.46 in that case. 

6.2 Parameter Recovery with Extrapolated 
Intrinsic Spectra 

In order to understand the effect that fiux extrapolation 
errors have on the recovered parameters, we tested IGM pa- 
rameter recovery using our more realistic mock optical depth 
profiles, calculated using the spectra extrapolated from the 
red side of the line profile. This mimics the process of obtain- 
ing such a profile from a high-z quasar spectrum, where the 
intrinsic flux is unknown. The distribution of 17200 best-fit 
values (86 observed spectra times 200 random density pro- 
files) from the optical depth analysis using these profiles is 
shown in Figure [TS] The same degeneracies as in Figure [TT1 



(with known spectra) are evident, but the peak of the dis- 
tribution has shifted to the minimum values of the internal 
and external neutral fraction, and to a lower value of the 
Hll region radius. Figure [121 compares the marginalized dis- 
tributions of recovered xigm values with known and with 
extrapolated intrinsic spectra. Both the median and peak of 
the distribution have moved to lower values, and the "sec- 
ondary" peak at a;iGM = —2.0, representing the integral of 
the tail of the distribution that would extend to lower values 
if not truncated by our finite logarithmic grid, now contains 
40% of the results. 

This bias in the fits toward underestimating the true 
neutral fraction is caused by the bias in the fiux extrapo- 
lation that we discussed in SJS] This can be seen in Figure 
1141 where we plot the recovered IGM parameters versus the 
red-to-blue fiux ratio about 1215.67 A. The shading shows 
the 2D distribution of all 17200 fit results. The points show 
the mean value for each of the 86 intrinsic spectral models 
(over 200 random density profiles). There are obvious corre- 
lations between the flux ratio and each of the fit parameters. 
Because there are more spectra with excess blue flux than 
excess red flux (Fig. [6} , and because the red-side-only fits 
tend to underestimate the blue-side flux of spectra with blue- 
shifted Lyman-cc lines (Figures [7] and |8]l, there are more red- 
side fits that underestimate the fiux than overestimate it. If 
the intrinsic fiux is underestimated, then the optical depth 
derived from it will also be underestimated. This tends to 
favor lower values of the internal and IGM neutral fractions 
in the optical depth fits, which makes sense, since both pa- 
rameters correlate positively with optical depth (though the 
wavelength-dependence of the bias must also play a role in 
how much each parameter is affected since the components 
of the optical depth have different profiles). It is a little more 
surprising that the H ll region radius shows the same sign in 
the correlation, since it has the opposite relationship with 
the optical depth. The correlation with the other parameters 
is apparently strong enough to outweigh the anti-correlation 
with optical depth. 

So far we have tested parameter recovery with the fairly 
high IGM neutral fraction of 0.1. In light of the current 
constraints on the IGM neutral fraction, however, it is in- 
teresting to explore the question: how likely is it that an 
ionized IGM will be mistaken for a neutral IGM? More 
specifically, what is the probability that the best fit values 
of Xigm ~ 1.0, 1.0, 0.2 for three quasars at z > 6.2 would 
be obtained (as in IMHOTI ) if the IGM were in fact highly 
ionized? To answer these questions we repeated our analysis 
with the input neutral fraction lowered to xigm = 10~^. 

Figure [15] shows the marginalized distributions of re- 
covered Xigm values with known and extrapolated intrinsic 
spectra. Even with known intrinsic spectra the median no 
longer reflects the input neutral fraction. With a highly- 
ionized IGM the damping wing is too weak to be reliably 
detected amid the noise of density fluctuations inside the 
ionized region. We can still place strong upper limits on 
the neutral fraction, however. With known intrinsic spec- 
tra (top panel), none of the best-fit values imply a neutral 
IGM, 94% of the recovered values were at log(a;iGM) ^ —0.5, 
and 61% at log(a;iGM) < —1.0. The added errors from using 
extrapolated spectra actually strengthen the upper limit at 
some confidence levels due to the bias toward underestimat- 
ing the neutral fraction. 96% of fits yield log(xiGM) —0.5, 
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and 85% of fits yield log(a;iGM) ^ — f.O. Tiie small amount 
of additional scatter to high neutral fractions is negligible. 
Only 0.3% of the fits yield a neutral IGM. 

Since the fits tend to perform worse close to the line 
center, we investigated the potential to reduce the bias by 
lowering the maximum wavelength of the pixels used in the 
optical depth fits. This cannot be taken too far, since there 
are already relatively few pixels used in these fits. Figure [161 
shows the results of setting the maximum wavelength of our 
analysis region to Amax ~ 1205 A rather than 1210 A. The 
fits (using extrapolated spectra) are dramatically improved. 
The bias is reduced, with the median substantially closer 
to the input value and the peak of the distribution at the 
input value. There is a substantial decrease in the scatter to- 
ward lower values (only 25% at log(a;iGM) ^ —2 versus 40%, 
and only a small increase in the scatter toward high values. 
Clearly, careful choice of the analysis region can reduce the 
impact of flux extrapolation errors. 

6.3 Correcting for Median Line Shifts 

Since we have shown that it is the mismatch between the 
(on average) blueshifted Lyman- a emission and the spectral 
models with line centers fixed at their nominal wavelength 
that is causing the bias in the optical depth fit parameters 
(most importantly the IGM neutral fraction xigm), we de- 
cided to attempt to correct for the mismatch by performing 
red-side-only line profile fits with centers offset by the me- 
dian shifts found in our full-profile fits. These fits were per- 
formed as described in 311 but with velocity offsets fixed at 
—303, —413, and —45 km/s for the narrow Lyman-a, broad 
Lyman- a, and Nv components. Figure [iTl (fractional flux 
excess versus wavelength) confirms that imposing a constant 
shift at the median value for each component largely elimi- 
nates the bias in the flux. As the flgure shows, the median 
flux excess is now close to zero everywhere. There is still a 
wide scatter, and the mean in fact shows a positive bias in 
the core of the line. The poor performance in the core is due 
to the difficulty of constraining the narrow component with 
the fit range restricted to Ao > 1220 A, as discussed in 331 

Figure [TSl shows the marginalized distribution of recov- 
ered xiGM values using the red-side-only fits with median 
velocity offsets to extrapolate the intrinsic spectrum. The 
overall distribution has shifted back toward higher values 
(closer to the correct input value), correcting the bias some- 
what, but the correction is smaller than we might have ex- 
pected. This crude correction for the median offsets of the 
emission components is insufficient to eliminate the bias. In- 
terestingly, the peaks and median values of the marginalized 
distributions for the If ll region radius and internal reference 
neutral fraction correspond to the input values, so this sim- 
ple correction has successfully eliminated the bias for these 
parameters, even though it did not do so for the IGM neu- 
tral fraction. It is unclear why this median correction did not 
eliminate the neutral fraction bias. The most likely expla- 
nation is that the neutral fraction is sensitive to the wave- 
length dependence of the fiux extrapolation errors in a way 
that differs from the other parameters. An additional con- 
cern (discussed earlier, in i]3.2p . is that the distribution of 
emission-component shifts in the high-redshift quasar pop- 
ulation may very well differ from the distribution sampled 
here. This could mean that the median values for the shifts 



obtained from this sample (or other low-z samples) would 
be biased relative to the high-z population. In SjZlwe discuss 
more sophisticated techniques for modeling and extrapolat- 
ing the intrinsic spectrum that should be more successful at 
eliminating the neutral fraction bias. 

6.4 Caveats 

While our overall conclusion that the bias is in the direction 
of low a;iGM values should be robust, some caveats apply to 
the specific numerical results discussed above: 

(i) These results assume that high-redshift quasars are 
drawn from the same population of intrinsic spectral shapes 
as our low-redshift sample. This is probably not a valid 
assumption in detail, since quasar spectra are known to 
show luminosity-dependent effects, but it may be a reason- 
able conservative a pproximation for our purposes. Studies of 
the Baldwin effect l|Baldwinll 19771 : lEspev fc Andreadis|[l999l : 
IShieldsll2007l ) , indicate that the strength of the Lyman-a line 
will decrease relative to the both the Nv line and the con- 
tinuum in more luminous quasars. A quasar 3 orders of mag- 
nitude more luminous than the mean for our sample would 
have a Nv equivalent width ro ughly 50% larger assuming 
a Baldwin effect slope of ~ 0.2 (|Espev fc A ndrcadis 1999), 
though other measurements indica te a lower value for the 
slope (0.0 from lDietrich et al.ll2002t ). Similarly, the Lyman-a 
line would be expected to have an equivalent width roughly 
30% lower for the same increase in luminosity. This means 
that the Nv line would be up to 2.2 times stronger relative 
to the Lyman-a line, and the N v line would still contribute 
no more than a few percent of the flux in the analysis re- 
gion. Problematic blends with other neighboring lines are 
similarly unlikely. For instance, the equivalent width of the 
Sill line at 1260 A shows little dependence on luminosity 
(|Dietrich et al.|[2002l ). so it should still be reliably excluded 
from our fits. Direct contamination of our analysis region 
(1199-1210 A), would therefore be negligible, even for a rel- 
atively enhanced N v line, but one concern is that the more 
prominent N v line would in some way bias the line pro- 
file fits. On the other hand, a stronger N v line could be 
fit to higher precision, and we have detected no tendency 
for the Nv line shifts to be biased, so this emission com- 
ponent might actually be modeled more accurately using 
high- luminosity quasar spectra. The fact that the Lyman-a 
line is weaker relative to the continuum in luminous ob- 
jects should also improve the accuracy of the fiux extrapola- 
tion, since it is easier to extrapolate the continuum than the 
line profile, in general. Therefore the simultaneous N v plus 
Lyman-a plus continuum fits may perform better at high 
redshift. We defer detailed simulation of parameter recovery 
with high-luminosity spectra to future work. Another con- 
cern is that, as we discussed in i]3.2l indications are that a 
high-redshift sample will have larger negative velocity shifts 
of the Lyman-a line relative to the metal lines. Since a larger 
blueward shift in the line center tends to result in a more se- 
vere underestimation of the neutral fraction, the lower limit 
should still be robust. 

(ii) We assume that the mean IGM density is independent 
of distance from the quasar over our region of interest (ex- 
cept for the sma ll change in mean density with redshift). As 
noted bv lMHOTl . the exclusion of the 6-11 A region immedi- 
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ately on the blue side of the nominal line center eliminates 
the f ew Mpc region expecte d to show s ignificant overden- 
sity (Barkana & Locb 2004). However, Kirkman fc Tvtlen 
Hobs), Guimaracs ot al. (2001), and .Rolhndc ct alj (|2005l ) 
have all inferred large-scale overdensities (on the scale of a 
few to tens of Mpc) in the IGM from studies of the proxim- 
ity effect. The ionized regions we are considering are much 
larger (~ 40 Mpc). However, if the bias did extend into 
our analysis region, then it would increase the mean den- 
sity close to the quasar. This differs from an error in the 
flux extrapolation because it shifts only the resonant con- 
tribution to the optical depth. Without detailed study, it 
is difficult to determine what affect this would have on the 
neutral fraction measurement. In future work, the density 
profile can be estimated from numerical cosmological simu- 
lations large enough to contain the rare massive host halos of 
bright quasars. Even if variations in the overdensity around 
such halos add to the uncertainty in the xigm constraint, 
this should not seriously impact the ability to distinguish 
highly-ionized and largely-neutral scenarios. 

(iii) We assume that the ionizing background is uniform. 
In reality the ionizing background arises from a complicated 
distribution of discrete sources, i.e. galaxies. Inside the large 
ionized region surrounding a luminous quasar, the back- 
ground will still be fairly smooth on large scales, since the 
mean free path of ionizing photons will be large. An overden- 
sity of galaxies would be expected close to the quasar, which 
would modify the Tr curve somewhat (tending to cancel out 
the effect of the density enhancement described above). If 
the density of galaxies is enhanced only very close to the 
quasar (on the order of a few Mpc), then their light will 
simply add to the quasar flux at larger radii. If, on the other 
hand, there is a large scale enhancement in the galaxy dis- 
tribution extending into our analysis region ( ~ 10 Mpc), 
then the fiux will not fall off like r~^, and this would need to 
be taken into account in future analyses of observed quasar 
spectra. 

(iv) Outside of the quasar H ll region, reionization of the 
IGM is expected to occur largely through the growth of 
discrete ionized bubbles separated by neutral gas. Our ex- 
pression for the damping wing assumes a uniformly ion- 
ized IGM with mean neutral fraction a;iGM. If, instead of 
being spread uniformly along the line of sight, the neu- 
tral gas is distributed in a "picket fence" of neutral re- 
gions s eparated by ionized reg i ons, t he damping wing will 
change. [Mesinger fc Furlanettol l|2008l ) studied this effect us- 
ing semi-numerical cosmological structure simulations, con- 
cluding that, if ignored, this effect biases the recovered neu- 
tral fraction by up to Xohs — xigm = 0.3 (where a;obs is 
the value inferred from observations of the damping wing), 
and induces a scatt er of a similar magnitude (see also, 
iMcQ uinn et ai]|2008l ). This is in the opposite direction to 
the b ias we find from flux e x trapo lation errors. However, 
when iMesinger fc Furlanettol l|2008l ) looked specifically at 
the bias induced by inhomogeneities around the largest halos 
(the likely hosts of bright quasars) dwelling in large ionized 
regions, they found that the bias toward overestimation of 
the neutral fraction was reduced, and even reversed for large 
neutral fractions, and that the inhomogeneity-induced mod- 
ification of the damping wing profile may favor underestima- 
tion of the neutral fraction. Taking these effects together, a 
measured value of a;obs = 1 is still unlikely if a;iGM ^ 0.02, 



but more detailed si mulation combine d with larger quasar 
samples are needed. iLidz et al.1 (|2007l ) point out another, 
related complication. The fluctuations in the ionizing back- 
ground produce a large scatter and bias in the relationship 
between the apparent red edge of the GP trough (where the 
flux drops below some threshold) and the actual location 
of the ionization front. However this should not affect our 
method, since we do not assume an -Rhii value in order to 
infer the IGM neutral fraction. 

(v) We ignore radiative transfer effects, which would be 
especially important at the edge of the ionized region where 
the relatively unattenuated flux from the quasar flrst en- 
counters a significant amount of neutral hydrogen. If the 
quasar spectrum is hard, the transition region between the 
highly-ionized interior and the more neutral exterior of the 
region could be broad enough to modify the observed opti- 
cal depth profile at the edge of the transmission w indow 
(|Kramer fc HaimanI l200i : iThomas fc Zaroubil |2008| ). It is 
unclear what effect this would have on the neutral fraction 
measurement. 

While the caveats listed above will certainly add to the 
scatter in recovered values, it still seems unlikely that a 
highly-ionized IGM will be mistaken for a highly-neutral 
one. 



7 CONCLUSIONS AND FUTURE 
DIRECTIONS 

The main conclusion of this paper is that errors in extrap- 
olating the intrinsic emission line shapes generally cause a 
bias towards underestimating the a;iGM value; these uncer- 
tainties therefore strengthen the conclusions Iu IMHOtI about 
a significantly neutral IGM at z > 6.2. From the marginal- 
ized distributions of recovered IGM parameters shown in 
the last section, we can see that it is highly unlikely that 
flux extrapolation errors could cause a highly-ionized IGM 
to be mistaken for a neutral IGM. If a;iGM = 0.001, there 
is less than a 4% chance of inferring a;iGM 0.3. Even 
with a;iGM = 0.1 there is only a 15% chance of measur- 
ing a;iGM 0.2 and a 5% chance of measuring xigm ^ 0.4. 
Neglecting other sources of error, if the neutral fraction were 
0.1 there would be less than a 0.04% chance of simulta- 
neously inferring the three best flt values found bv lMHOTl . 

Besides addressing the caveats mentioned in the last 
section, there are two basic directions in which we plan to 
develop this technique in the future. The first is to improve 
our ability to model and fit the damping wing. This includes 
improving our modeling of the underlying intrinsic quasar 
spectra, our models of the optical depth profiles, and the re- 
covery of information from the observed optical depth pro- 
files. 

In order to improve the extrapolation of the intrinsic 
flux, we will need to exploit correlations between the Lyman- 
a emission components and other spectral features (emis- 
sion lines or continuum luminosity), either through con- 
straints applied to traditional multi-component models, or 
throug h the use of princi pal component analysis. For in- 
stance, IShang et al.l l|2007l ) cite a correlation between the 
Lyman-a and C iv line shifts. We find a similar correla- 
tion in our sample, which we show in Figure 1191 The fig- 
ure demonstrates that the shifts of the lines (determined 
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from fits of a quadratic function to points near the peak) 
are well correlated (the correlation coefficient is 0.68, but 
would be substantially larger with the exclusion of a few 
outliers). . Tytle r fc Farj (|l992h . on the other hand, failed to 
find significant correlations between the shifts of this (and 
any other) pair of lines when they carefully determined sys- 
tematic redshifts from multiple emission lines. This suggests 
that the correlation we see may be due, at least in part, to 
errors in the determination of the systematic redshift. For 
our purposes, however, it does not matter why the correla- 
tion appears, only that it could be exploited to reduce the 
uncertainty on the location of the intrinsic Lyman-a emis- 
sion line. In addition, one could possibly use more detailed 
multi-component fits to the shape of the C iv line as a tem- 
plate for both the Lyman-a and Nv lines. We expect to 
be able to reduce the flux errors to the point that they are 
negligible in comparison to other sources of uncertainty. 

In addition to improving the spectral fits, we may be 
able increase the amount of information extracted from the 
optical depth profiles. Binning the optical depth values, as 
we have done in performing our K-S tests, discards most of 
the wavelength information. Conceivably, there is additional 
information in the detailed wavelength-dependence of the 
optical depth profiles, extractable with a different statistical 
comparison to the model profiles. 

Another source of additional information is the Lyman- 
|3 region of the absorption spectrum. While its interpretation 
is complicated by the overlying Lyman-a absorption, careful 
analysis and comparison with simulat ions should allow i t 
to su pplement the Lyman-a fits (e.g. Mesinger fc Haimanj 
|2004| ). 

Such improvements should be explored in order to 
take full advantage of future samples of high-z quasars. As 
surveys push deeper, more z > 6 quasars are constantly 
being discovere d. For example, the Canada-France High-z 
Quasar Survey l|Willott et al.ll2009l ) found 10 new z > 5.9 
quasars, inc l udini 



ing the transverse and line-of-sight proximity effects of 
quasars at z ~ 2, found intriguing evidence of large- 
scale overdensities near the quasars, as well as evidence 
for quasar lif etimes shorter than 10^ years. Other pre- 
vious studies (iRoUinde et al. 20051: Guimaraes et al. I l2007l : 



l|jiang et aL 200. 



2 at z > 6.2, the SDSS Deep Stripe 
yielded 5 n ew z > 5.85 quasars, and 

6.13. 



UKIDSS (|Mortlock et ai]l2008l ) found one at z = 6.13. The 
quasar luminosity function is extremely steep at the bright 
end that is currently being sampled at high redshift, mean- 
ing that modest increases in the sensitivity of future sur- 
veys should yield large increases in the number of discov- 
ered quasars. Even if the available spectra of such objects 
have significantly poorer signal-to-noise ratios than current 
state-of-the-art z > 6 quasar spectra, they should still be 
suitable for this purpose, since density fiuctuations domi- 
nate the scatter in the current results. 

The techniques outlined here could also be applied to 
a wider range of problems. The models (minus the IGM 
damping wing) and statistical techniques described here 
for absorption spectra of z; > 6 quasars can just as well 
be applied at z < 6 to probe the density and ionizing 
background near quasars. As LSST and other future sur- 
veys perform deep searches for high-z objects, many lower- 
redshift quasars will be discovered, which will be valuable 
as probes of typical quasar environments after the end of 
reionization. At these lower redshifts, rather than creat- 
ing a well- defined H ll region, quasars exhi bit a "proxim- 
ity effect" (|Baitlik. Duncan fc Ostrikeijll988l ) on the nearby 
Lyman-a forest, red ucing the amount o f abs orption near 
the source redshift. iKirkman fc Tvtleij l|2008h . by study- 



iDaU'Aglio. Wisotzki fc WorseckT Hoi) have also expl^d 
the proximity effect at z < 4.5. All of these groups estimated 
the underlying continuum by iterative fitting to transmis- 
sion windows between Lyman-a forest lines. This technique 
is limited to z; < 4.5, since beyond this redshift the Lyman-a 
forest is too thick to reliably infer the continuum. Above this 
redshift, the continuum must be extrapolated from unab- 
sorbed regions of the spectrum. With the improved spectral 
modeling techniques we hope to develop, a uniform analysis 
of the proximity effect could be performed from low redshift 
through z > 6, without having to change the techniques 
used to model the underlying spectrum. 
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APPENDIX A: ALIGNING AND COADDING 
SPECTRA 

All GHRS and some FOS (rapid-readout mode) datasets 
are composed of sets of separate exposures. These separate 
spectra must be aligned before being coadded to preserve 
spectral resolution. Also, some objects were observed mul- 
tiple times with the same instrument configuration, and we 
wanted to coadd these spectra to maximize the signal-to- 
noise ratio. We chose to only coadd spectra with the same in- 
strument configuration (instrument, grating, and aperture) 
so that we would not be combining spectra with different 
resolutions. Therefore we ended up with multiple spectra 
for some objects. 

Our code automatically tries three different methods for 
aligning the spectra and chooses the one that minimizes the 
deviations between the input spectra and the mean (as de- 
fined below). The methods are to align the spectra with no 
offsets, based simply on the supplied wavelength coordinates 
for each spectrum, to align the spectra by maximizing the 
correlation function, and to first smooth the spectra, then 
align them by maximizing the correlation between smoothed 
spectra. The method that minimizes deviations between the 
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aligned spectra and the mean spectrum (as measured by 
srms, defined below), is chosen for the final output. We ex- 
clude 10% of the pixels with the lowest signal-to-noise ratio 
in the mean spectrum, then sum over all of the remaining 
pixels in each input spectrum in order to characterize the 
quality of the alignment: 



Srms = 




where fi is the flux value for a pixel in an input spectrum, 
{f}i is the flux value in the corresponding pixel in the mean 
spectrum, and E{Si) is the expectation value of the standard 
deviation for that pixel, given by 

E{Sf) = ^J2''^ (A2) 
j 

, where aj is the uncertainty on flux value j, and the sum is 
over all of the input pixels contributing to a single pixel in 
the mean spectrum. 

The general procedure for aligning and coadding spectra 
is as follows: 

Spectra are interpolated onto a common, uniform, wave- 
length grid, then (if desired) smoothed with a Gaussian ker- 
nel {a — 3 pixels). 

Aligning the spectra: One spectrum is chosen as the ref- 
erence spectrum and the correlation function is calculated 
between it and each remaining spectrum. The correlation 
between the reference spectrum F and another spectrum / 
as a function of pixel offset s is 

CfAs) = 5] (i^. - {F)) ih+s -if)), (A3) 

i 

where Fi is the flux in pixel i of the reference spectrum, fi+s 
is the flux in pixel i -I- s in the other spectrum, and (F) and 
(/) are the mean flux values of each spectrum. 

For each spectrum, the offset is calculated by finding 
the peak of a parabola passing through the maximum of 
the correlation function and its two neighboring points. The 
wavelength coordinates of the spectra are then shifted by 
the calculated offset and the spectra are reinterpolated onto 
a common wavelength scale. 

Coadding spectra: Spectra are coadded (flux values for 
each pixel are averaged), excluding outliers and choosing a 
weighting scheme that maximizes the signal-to-noise. If the 
spectra have an average signal-to-noise ratio of less than 2, 
we use an unweighted mean of the pixel values from each 
spectrum. Otherwise the weights are Wi = Fi/a^, where Fi 
is the fiux in pixel i and ai is the uncertainty. If the uncer- 
tainties are Poisson-dominated, this yields Wi — hi/dvidtiAi 
where di/ is the bandwidth of the pixel, dt is the integration 
time, and A is the effective area. This has the advantage over 
the traditional Wi = of not biasing the results when the 
uncertainty is estimated from the data. We also propagate 
the statistical uncertainties to calculate the uncertainties for 
each pixel in the weighted mean spectrum. 

Excluding low signal-to-noise spectra: If we are using the 
unweighted mean, then spectra are excluded if their signal- 
to-noise ratio is below {[n^/(n — 1)] — (n — 1)}"^/^ times 
the mean signal-to-noise ratio. This threshold defines the 
approximate level below which adding a spectrum actually 
degrades the overall signal-to-noise ratio. 



Normalizing spectra: We compare the normalization of 
each input spectrum to the mean by calculating the mean 
fractional flux offset between it and the weighted mean spec- 
trum. The signiflcance of the flux offset is then calculated, 
and any spectra that are more than 3a below the mean 
are normalized to have the same average flux as the mean. 
We only re-normalize spectra below the mean because we 
assume that flux offsets are caused by non-optimal position- 
ing of the source in the aperture, which can only result in 
missing flux. This procedure is iterated until all mean flux 
differences are less than 3a. Any bias introduced by this pro- 
cedure should be unimportant, since we only care about the 
overall shape of the spectrum, not its normalization. 

Outlier exclusion proceeds in two phases. First we iter- 
atively exclude individual pixels indentified as outliers, then 
we exclude all data at wavelength coordinates where the 
spectrum-to-spectrum dispersion is too high. 

Excluding outlying pixel values: We calculate the dif- 
ference between each pixel value in the input spectra and 
the value of that pixel in the mean spectrum, and calcu- 
lated the uncertainty of that difference. We also calculate 
the standard deviation of the values for each pixel. Any 
value deviating more than 5 times the propagated uncer- 
tainty from the mean and more than 1 standard deviation 
from the mean is excluded. 

Flagging had pixels: The expected level of deviation S 
among the n data points for each pixel is given by E{S^) = 
{l/n)'^af, where ai are the uncertainties on each value0 
any pixel in which the standard deviation of the input values 
is higher than the 3S is flagged as bad and excluded from 
future analysis. 

Finally an output spectrum is constructed by averaging 
the input spectra, with wavelength offsets and weights if 
appropriate, and excluding any detected outliers. 



APPENDIX B: AUTOMATIC ABSORPTION 
FEATURE DETECTION 

In order to characterize the intrinsic quasar spectra, we need 
to exclude absorption features from our fits, and we need to 
do so automatically to take advantage of large samples of 
quasars and have reproducible results. At the same time, we 
want to avoid biasing our fits by removing low-flux pixels 
that represent real variations in the intrinsic quasar spec- 
trum. We do this using two rounds of iterative feature de- 
tection. In the first round, we apply our absorption-feature 
threshold only to pixels deviating below the model spec- 
trum. In the second round we apply the same threshold to 
pixels deviating above the model spectrum. This way, if our 
threshold is too stringent we will notice spurious fiagging 
of positive deviations. This had the unexpected benefit of 
automatically detecting and excluding pixels influenced by 
blending with neighboring broad emission lines in several 
cases. 

Our feature detection is iterative. First, we fit a model 
to the observed fiux profile. We then smooth the observed 
spectrum, model spectrum, and the error spectra of each, 
and calculate the difference between the smoothed model 

^ http:/ /www. amstat.org/publications/jse/vl3nl/vardeman. html 
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and observed spectra. We then calculate a feature detection 
threshold 

/La.h(A) = [Cf /fit(A)]' + [Ce CTobs(A)]' + a^X) (Bl) 

using statistical uncertainties on the model flux ((Jflt), un- 
certainty on the measured flux (cTobs), and the observed flux 
(/fit). The constants c/ = 0.05, Ce = 6, were chosen by hand 
to be rather conservative. That is, wc err on the side of not 
excluding features, rather than risk falsely excluding "good" 
pixels. Any pixel in which the magnitude of the smoothed 
residuals exceeds the threshold is flagged as an absorption 
or emission feature. Pixel within ±AA of a flagged pixel are 
then excluded around the detected features, and the fit is 
repeated, excluding the flagged features (or only the flagged 
absorption features in the first round). 

AA = 2V«^^i„ + <3„ (B2) 

where = 0.75 A and winst = FWHM/2.355, with 

FWHM being the full width at half maximum of the in- 
strumental line spread function (LSF). The iteration stops 
once no more features are fiagged. 
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Figure 14. Best-fit parameters versus the red-to-blue flux ratio 
of tlie Lyman- DC emission about 1215.67 A. The shading indi- 
cates how many fits fell in a given region of the graph. The red 
points plot the mean (over 200 random IGM density profiles) of 
the recovered parameter values for each input spectrum. Input 
parameters are Rhii = 40.5 Mpc, x^ef = 10~^'^, xjqm = 0.1. 
See Fig. |6] (top panel) for the distribution of the fiux ratio in our 
sample of spectra. 



Figure 13. Parameter recovery map using mock spectra with un- 
known intrinsic spectra. The shading of each square indicates the 
number of best-fit values that fell within that cube in parameter 
space. The input parameters (Run = 40.5 Mpc, x^ef = 10~^'^, 
2^IGM = 0.1) are indicated with a white point. Compare with 
Figure [TT] which has the same input parameters but uses known 
intrinsic quasar spectra: an additional scatter, as well as a bias 
has been induced by errors in the extrapolation of the intrinsic 
spectrum. 
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Figure 15. Marginalized (over -Rhii and x^^f) distribution of 
estimated xjqm values with perfectly known (top) and extrap- 
olated (bottom) intrinsic spectra, for a highly-ionized fiducial 
IGM. Input parameters are -Rjjll = 40.5 Mpc, Xj-ef = 10~^'^, 
^IGM = 0.001. Note that the x-axis scale extends to lower values 
than in Fig. [12] because we used a different parameter-space grid. 
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Figure 16. Marginalized recovered x'igm distribution with the 
wavelength range of the fit restricted to A < 1205 A. Compare 
to Fi gure |12| (bottom panel) where Amax — 1210 A. The input 
parameters are -Rhii = 40.5 Mpc, x^cf = lO"^ "", xiqm = 0.1 (as 
in Fig. I12I I. Lowering the upper limit of the fit range has reduced 
the bias towards low values of xjgm at the expense of some extra 
scatter toward higher values. 
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Figure 17. Fractional flux excess in the red-side-only fits (with 
velocity offsets fixed at their median values) versus wavelength. 
The excess is calculated by comparing the red-side-only best-fit 
model (-Fext) to the full-profile fit (-Ffuii). The top panel shows 
the models fit to the full profiles of all 87 spectra (light grey) and 
the median (dashed) and mean and standard deviation (solid) of 
the set of model spectra. The bottom panel shows the fractional 
difference between the models fit to the red side only and the 
models fit to the full profile for all 87 spectra (light grey), and 
the median (dashed) and mean and standard deviation (solid) 
of the fractional flux difference. The vertical solid line (green) 
indicates the nominal central wavelength of the Lyman- a line. 
The vertical dashed line (blue) at 1220 A indicates the blue edge 
of the red-side-only line profile fit region. The vertical dotted lines 
at 1199 A and 1210 A (red) demarcate the typical analysis region 
for the high-redshift IGM measurements. Compare with Figure 
1171 (which has velocity offsets fixed at zero) . 
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Figure 18. Marginalized recovered xiqm distribution with veloc- 
ity offsets of the components fixed at their median values. Com- 
pare to Figure 1121 (bottom panel) where the offsets are fixed at 
zero. The input parameters are -Rjjll = 40.5 Mpc, x^ef = 10~^'^, 
^IGM = 0.1 (as in Fig. I12| l. Using the median offsets has slightly 
reduced the bias in the recovered neutral fraction distribution. 
Compare to Fig. 1121 bottom panel. 
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Figure 19. Shift of tlic Civ (1549 A) line peak versus shift of 
the Lyman- a peak. Tlie solid line is the unweighted least-squares 
fit to the plotted points. The dotted line indicates equal shifts. 
Note that this plot is preliminary. 
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Table 1: HST (FOS and GHRS) quasar Lyman- oc emission line spectra. 



Name 


z 


Instrument 


Aperture 


Grating 


S/N 


Comments on spectra 


(NED) 


(NED) 








(mean) 


and fits. 






MRK 0335 


0.026 


FOS 


B-3 


H13 


16.2 


Fit with free line centers flags a 














large number of pixels as "emis- 














sion" features. 




FAIRALL 0009 


0.047 


FOS 


B-3 


H13 


10.8 


Excluded 


due 


to non- 














convergence 


of 


feature de- 














tection. 






UGC 00545 


0.061 


FOS 


B-3 


H13 


25.6 


Excluded 


due 


to non- 














convergence 


of 


feature de- 














tection. 






UGC 11763 


0.063 


HRS 


LSA 


G140L 


20.5 


Excluded 


due 


to non- 














convergence 


of 


feature de- 














tection. 






MR 2251-178 


0.064 


FOS 


B-3 


H13 


14.3 


Excluded 


due 


to non- 














convergence 


of 


feature de- 














tection. 






TON 1187 


0.079 


FOS 


B-3 


H13 


11.0 


Feature detection did not con- 














verge for red-side fit with zero 














shift. 






MRK 0478 


0.079 


FOS 


B-3 


H13 


11.6 


Excluded due non 


-convergence 














of feature detection (all flts). 


PG 1211-1-143 


0.081 


FOS 


B-3 


H13 


11.2 


Excluded 


due 


to non- 














convergence 


of 


feature de- 














tection. 






PG 1211-1-143 


0.081 


HRS 


LSA 


G140L 


26.7 


Excluded 


due 


to non- 














convergence 


of 


feature de- 














tection. 






SDSS 


0.117 


HRS 


LSA 


G140L 


25.0 


Excluded due to 


feature de- 


J12305003+0115226 












tection non 


-convergence (and 














many absorption features). 


SBS 1626-f554 


0.133 


FOS 


B-3 


H13 


10.5 


Strong absorption feature at 














line center. 






PG 0026+129 


0.142 


FOS 


B-3 


H13 


15.0 


Full flt spuriously excludes a 














large region 


on red side as ab- 














sorption, but fits 


well on the 














blue side of the line. 
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Table 1 - continued from previous page 





z 


Instrument 


Aperture 


Grating 


O/ IM 


Comments on spectra 


1 IN ij 1 J I 


1 IN HiLJ J 








(rncQjn) 


and fits. 


T)f ^ 111/1 A 


U.144 


r KJd 


P Q 

rs-o 


W1 Q 


1 O 7 

Iz. r 


Excluded due to non- 














convergence of feature de- 














tection. 




U.lOo 


r Wo 


9 

w-z 


TT1 Q 


90 Q 

zu.y 




^ i o 




FOCI 


R 9 

D-Z 


TT1 ^ 


1 Q Q 
ly .y 




0\y z / o 


n 1 


r Wo 


R ^ 


Kl ^ 


1 7 






U.lOo 


r yjo 


A 1 


Wl ^ 
mo 


17 
1 / .u 




QP 97Q 


1 "iS 


17 vyo 


B-1 


H13 


IS 1 

±0.1. 




PP 1 ^99 J-ficiQ 


1 

U.lDo 


r Wo 


R ^ 


TT1 ^ 


ISO 
lo.U 


ruli lit excludes a large section 














of blue side contaminated with 














loivi lines. 


PP 111 fiJ-91 


U.l / 


FOQ 

r Wo 


P 9 

w-z 


PT1 ^ 

nio 


1 Pi Q 

10. y 




rPTRSQl 1497-1-480 


0.221 


FD^ 


R 


H13 


12.8 


iT. UbUl Ij lUli IccLUUlt; ill UlUc Wllig, 














r\T 't'rio lino iTiQOTTO/n in Tnll ttI"1 
KJL tllc lilit; Ilid^^tJLl ill iU.il liUJ. 


pn nQ^^-i-414 


9^4 


FOCI 

JT WO 


P 9 
w-z 


TT1 ^ 


111 


-rt. uoUi p tiuil leaiuie at liiie ceii- 














ter. 


ixiJDoy 1 ± j-OUn^zio 


^4Q 


FOCI 
r wo 


R ^ 


TT1 Q 

X3. j.y 


10 7 

lU. i 




pn in4Q-nn'^ 


^fiO 


wo 


0-9 


H19 




iZjAL-i ULlcU. LlUt; tU UcWJ. U1U.C cUgC 














KJL sptJl^ti Ulii. 


[HRSQ] 149^-I-9(S7 

IXXJ-->Oi7j ±^ZnJT^Zi\J 1 


0.366 


FOS 


B-3 


H19 


12.5 


O lii Llilg dUOLJi JJ tHJil ililC J LiO u 














blueward of line center. 


XlO UDZ4-|-DyU ( 


U.o ( u 


FOQ 
r Wo 


P 9 


W1 Q 
riiy 


17 9 
1 / .Z 






^79 


r Wo 


P 9 
W-Z 


tri Q 

my 


1 ^ Q 

lo.y 


ii/XcludeQ due to bad blue edge 














of spectrum. 




400 

U.4:UU 


FO^ 

-T WO 


R 


HI Q 


97 4 


riigiiiy aby liiiiieiiiic piuiiie. xxeci- 














side fit with, median shifts failed 














to converge. 


^P 91 


419 
U.41Z 


FOCI 
r Wo 


A 1 


W1 Q 
my 


117 
11. ( 






414 


FOCI 

JT WO 


A 1 

/A.- J. 


HI Q 
xi j.y 


10^ 

lU.O 




rTTRRQl lO/IQ-l-filf^ 


/191 


FOQ 
r Wo 


P 1 

W-1 


T 1 


10.4 




rPTR^^Q] 9'^0^^-l-OQ^^ 


4^^ 


FO^ 
r wo 


R 


HI Q 
n±y 


17 4 




PG 0003+158 


0.451 


FOS 


C-2 


H19 


19.9 




FBQS 


0.461 


FOS 


C-2 


H19 


13.3 




J0745416+314256 














[HB89] 2112+059 


0.466 


FOS 


C-2 


H19 


13.0 




NEDOl 














SBS 1259+593 


0.478 


FOS 


C-2 


H19 


16.1 


Poorly constrained very narrow 














component in all red-side fits. 
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Table 1 - continued from previous page 





z 


Tnd'T'n TYncirt" 


x^jjjci t Lilt; 


I -It* q ■J" in fT 




V^UllllliCllLo fJll OjjJcCLlcl 


(NED) 


(NED) 








(mean) 


and fits. 




U.UU-L 




P-9 


H19 


1 Q Q 


T-Tiill T^T'i~iTild Ti^" Ti CI rrc T"£iri CI ri ^ /^f 
Uil piOiiit; lit ild^o ICU Ui 

the Ly-alpha line core. 


[HB89] 1130+111 


0.510 


EOS 


C-2 


H19 


10.6 


Excluded due to bad blue edge. 


rwRao! nsp;n_L/i/in 
[xiijoyj Uoou-f-^^u 


U.014 


r kJd 


P 9 

w-z 


W1 Q 


lU.o 




FBQS 


0.530 


EOS 


C-2 


H19 


10.9 


Missing some flux values at 


Tf)Q'iS9nQ-l-S994n9 












lino ppTilpT" 1 fl p crcTfsri H nfin cr 
L/UdLlQlll^ J . 


[HB89] 0454-220 


0.533 


EOS 


B-3 


H19 


25.1 


Excluded due to non- 
tection. 


lKJi>i UiOD 


u.o4y 


r vJo 


rS-o 


XT1 Q 

rliy 


oi.D 


Excluded due to problems with 
coadded spectrum. 




u.ooo 


-T WO 


u o 


H19 


12.5 




1 Kjj lOOO-\-H{j 


U.OOo 


r v^o 


P 9 
w-z 


TT1 Q 
xiiy 


19^ 
IZ.O 


li;Xclud,eQ due to bcid blue edge 
of spectrum. 




U.OOO 




P 9 

w-z 


W1 Q 
my 


10. U 




IN VjV_>' Zo^-L UJjO 


U.OOO 


FOQ 


P 9 

w-z 


TT1 Q 


14.0 




ixiooy 1 ± ±ou-io<j 


U.OOo 


FOCI 
r wo 


P 9 
w-z 


TT1 Q 

±±±cf 


1 l.O 


XXcLl-olLlt; liL bp III lU Lloiy llcLggtJQ 

some of narrow component. 


[xiDoyj U'iuo-izo 


U.O i o 


r Wo 


P 9 
w-z 


xiiy 


1 S 9 
lo.z 






u.oyo 


r wo 


P 9 
w-z 




1 A n 


iXtlgjlUIl (Jl lt;Ll blLlt; lldjg^cCl do 

emission in full-profile fit. 




U.DIZ 


FOQ 
r Wo 


P 9 
W-Z 


HI Q 


1 Q Q 

ly.y 


















oL/ uyo 


U.DiD 


r Wo 


P 9 

w-z 


riiy 


17 9 
1 / .Z 




PG 0044+030 


0.623 


EOS 


C-2 


H19 


10.8 




[HB89] 1104+167 


0.632 


EOS 


C-2 


H19 


17.9 




[XllJoyj ZZ4o-izo 


U.DOZ 


r Wo 


P 9 


rliy 


17 9 
1 / .Z 




o^^ zuo 


fidfi 
U.U^U 


FOCI 
r wo 


A 1 


HI Q 


10 9 
±u.z 




3C 263 


0.646 


EOS 


C-2 


H19 


14.2 




OKy UO ( 


u.Doy 


r v_/o 




rliy 


9^ 9 




[HB89] 2344+092 


0.677 


EOS 


C-2 


H19 


12.5 


Fiill fit flap's pHppr of t.Vip naT*- 
row component as emission fear 
tures. 


[HB89] 0923+392 


0.695 


EOS 


A-1 


H19 


21.1 




[HB89] 2352-342 


0.702 


EOS 


C-2 


H19 


15.8 


Red edge of narrow component 
is flagged in red-side flts. 


[HB89] 1354+195 


0.720 


EOS 


C-2 


H19 


11.4 
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Table 1 - continued from previous page 





z 


Instrument 


Aperture 


Grating 


0/ iM 


Comments on spectra 


1 IN I J 1 y I 


I IN 1- 1 y J 














n 790 

u. ( zy 


FOG 


R '\ 


■R1 
my 


10.0 


Excluded due to poor signal-to- 
noise, weak emission lines. 


[xlijoyj lDO/-rO/4l: 


7'^1 
U. ( Oi 


r \Jo 


A 1 


■R1 

my 


1 n 




rWR^^Q] 1 ^'^R-i-A77 
ixiijoyi i-OOO^^ i i 


n 779 


-T WO 


C 9 
w-z 


W1 

n j.y 


91 7 

Zl . ( 


oe vei di aubui p tiuii ledi ui eo 011 
icu. alio, uiue oiqc. 


91\/T A qq; 


n 77^ 


FOG 
r Wo 


P 9 
w-z 


TT1 Q 

my 


19^ 
IZ.O 


Poorly constrained narrow 


J1003067+681316 












component in red-side fits. 


or^ 1 1 n 
iiu 


n 77c; 


FHQ 
r vjo 


R Q 


TT1 


19 1 




r JjV^o 


n 7sn 

U. / oU 


r vjo 


R Q 
JD-O 


wi 
xl±y 


9S Q 

zcs.y 




J1253175+310550 














T "Rr^G nino 071 q 
L/JdI^o UiUz-z/io 


n 7QO 


r vJo 


R Q 

rS-o 


IT1 Q 


97 A 




okj Zoo 


u.o4y 


r Wo 


A 1 


T 1 


1 Q 9 






u.ooy 


FOG 
r Wo 


P 9 

w-z 


nz ( 


19 


JN arrow component poorly con- 
strained in all red-side fits. 


3C 4543 


0.859 


FOS 


A-l 


H27 


13.4 






U.oOl 


FOG 
r Wo 


-D-O 


H97 


1 7 
10. ( 


Narrow component poorly con- 

DLiaiilcU. Ill ieU,-olU,c llLo. 


TTTRSQl 19^^90-110 


U.o ( 


FOG 
r Wo 




■H'97 


1 9 
IZ.O 




LBQS 0253-0138 


0.879 


FOS 


A-l 


H27 


10.2 




ixixjoyj zo'±u~uou 


u.oyz 


FO^ 

I: WO 


w-z 


H27 


1 fi n 


T ./^'J'C! i~\T r>Q/n T»TV£lla FllU Tl^" rtQC 

J-jtJto Ui UaAJL piA-tJlb. J? U.11 lit lido 

unconstrained emission compo- 
nent on red side of line. 




u.oyj 


FO^ 

x" wo 


A-l 


H27 


1 Q 




rWRSQl 991 HQS 


n 00 1 


FOQ 
r Wo 


A 1 


W07 
xlz ( 


1/17 
14. ( 




PT<"G 0S9Q 99Q 


n 01 


r Wo 


R Q 


W07 
riz / 


1 Pi 
10.0 


Excluded due to lack 01 visible 
emission lines in spectrum. 


3C 336 


0.927 


FOS 


B-3 


H27 


13.9 


ISM absorption line in center of 

l_j V dlUlld llllt;. \-/ lilltJl Ul (-J1 111 11 til 11) 

absorption features. 










nz / 


10. u 


-CjXCiUClctl LlUt: LO WcdiK cllllDOlOll 


J1409239+261821 












lines. 


SBS 1340+606 


0.964 


FOS 


A-l 


H27 


12.7 


Excluded due to strong absorp- 
tion features (blue- and red- 
side). 


3C 094 


0.965 


FOS 


A-l 


H27 


10.9 


Excluded due to strong absorp- 
tion linos (red and blue side). 


TON 0157 


0.971 


FOS 


B-3 


H27 


15.6 


Negative narrow component in 
red-side fit. 
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Table 1 - continued from previous page 





z 


Instrument 


Aperture 


Grating 


O/ iM 


Comments on spectra 


1 IN I J 1 y I 


I IN I J I y 1 














Q7f^ 


r v_yo 


A 1 


TT97 

XlZ < 


1 Pi 9 

iO.Z 


Very broad, blended emission 














lines. 


[rlJjoyj Zl40-t-UD( 


000 


PHQ 
r kJd 




tT97 
xlZ / 


1 Q 


Highly asymmetrical profile. 


ixTiJoy 1 \Jooo-^oo 




prm 

r wo 


A 1 


M97 

XlZ / 


19 


iNaiiuw cuiiijpoiieiiL jpuoiiy cuii- 














oiiaiiieu. Ill leu-oiQc iiio. 


TOM 01 f^'^ 


1 099 


r v^o 




R97 
nz / 


ID.O 






1 09'^ 


prm 

p wo 


C 9 


R97 

XXZ J 




iMaiiuw cuiiijpoiieiiL jpuoiiy con- 














strained in red-side fit. R,ed-side 














fit with, median shifts failed to 














cuiivei^e. V eiy ui oao uieiioeci 














emission lines. 




1 0*^*^ 
1 .Uoo 


r wo 


R 9 


■R97 
nz 1 


1 R 1 

ID. i 




rWR^^Ql 99*^0-1-114 


1 0^7 


wo 


A-1 


H27 


13.2 




LBQS 1229-0207 


1.043 


FOS 


C-1 


H27 


17.8 




[HB89] 2302+029 


1.044 


FOS 


B-3 


H27 


25.0 


Unusual line shape and lots of 














absorption features. 


1 ^1 7-1- '^90 




wo 


A-1 


H27 


12.5 






1 084 


r Wo 


P 9 


TT97 

XlZ 1 


97 Pi 
z i .0 


ocveral absorption features and 














many uaci pixeio. 


TTTRSQl nn9A-l-99A 


1 1 1 Q 


r Wo 


P 9 


TT97 

XlZ f 


1 1 Q 

11. y 


Narrow absorption in line cen- 
ter. 


PP 1 ^Pi9J-01 1 


1 1 97 


r Wo 


P 9 
^-z 


R97 

XlZ I 


14 


Several moderate-width ab- 














oOl p t luii leai ui CO . r uui ly 














constrained very narrow com- 














puiieiit 111 ulue-oicie iito. 


Pn 190fi-l-4'^Q 




FOCI 
r wo 


P 9 


XlZ / 


1 p; p; 


ooliie iiai 1 ow aooui p Liuii leci- 














tures. 


PP 1 QQft_L/l 1 


191/1 


r Wo 


P 9 


R97 
XlZ ( 


1 9 Q 
IZ.O 


Excluded due to prominent 














IGM absorption features. 


Pn 0Q4fi-l-^01 


1.221 


FO^ 

IP wo 


1 ) • ) 


H27 


90 7 

ZU. i 


Fv/^liiHori H 1 1 o c'^" vi^iTi rr rw/^Q/H 
JjjAL/I LlUt^U. U Lie Lr(J o Li UIIk , UI vJcHJ. 














absorption features. 


[HB89] 1038+064 


1.270 


FOS 


B-2 


H27 


19.6 


ggygj-gj^j narrow absorption lines. 


PG 1241+176 


1.273 


FOS 


C-2 


H27 


20.7 


Some narrow blue-side absorp- 














tion features. 


PG 1008+133 


1.289 


FOS 


C-2 


H27 


17.4 


Missing red-side fit with median 














shifts due to non- convergence of 














feature detection. Many narrow 














absorption features. 



Continued on next page 







Tahip 1 - 


continued from previous page 






z 


Tnd'T'n TYicirt" 


Aperture 


Grating 


S/N 


WUlllllidlLo fJll ojJcCljlcl 


(NED) 


(NED) 










&nci fits. 


[XlDoyj iD04-i-iUD 


1 QQ/1 
i.004 


r Uo 


C-2 


H27 


45.6 


Excluded due to non- 




























tection. 


IXTlJoy 1 yJ^O'^^UOc/ 


1 .04:0 


r wo 


C-1 


H27 


17.0 


H VJ^Ill/H^liH 1 1 £i "^"/^ Tl/^T^ 

Il/XCi UQcU. QUt; tu llUIi- 














CfJliVclgclHw/C (Jl itJclljlilt; Ut?" 
















[xiijoyj U'io^t-rUoy 


1.040 


r Wo 


B-3 


H27 


14.3 


Lots of n&rrow ctbsorption fea- 














tures. 




l.O / o 


FOCI 


C-2 


H27 


12.8 


OLlUll^ licLllUW dUbUl JJLiUll iL-d- 














tu.icju.oij icu-Wdiiu. Kji line (^ciiijCi . 


IXlJDoyj \JO\J^~^^0 




FOCI 
r wo 


B-1 


H27 


23.5 


Narrow component poorly con- 














sLldjiliCLl Hi ICLl-alLlC ills. 


[HR^^Ql nQ^7-(-^fi1 
ixxxjoyj uytj i ^tjui- 


1.414 


EOS 


B-1 


H27 


44.6 


FjYpIi 1 H pH Hup i~r\ ^'i'Tr\T\ c fi i\— 
uj ACi U-vJ-CU. \j. uc iiyj Dili kjii^ cxu 














sorption features (red and blue 














side) . 


QP 9QC 

Ovy zyo 


1 4^7 


FO^ 
r wo 


ID-O 


■R97 


91 


rijXCi UClcCl tlUfc; lU obi (Jll^, Ul OcLQ 














absorption lines (red and blue 














side) . 


[HRSQl 09*^9-049 


1.440 


EOS 


B-2 


H27 


24.0 






1.4:OU 


FOCI 
J? wo 


A-1 


H27 


24.3 


JZjXCi UUcU. Q Ut! t U lllloolllg 11 U-X. 


T1 91 ^^zLnzL7-i-p;ni ^4^4 












VcLlLltJb. 


TTA/T 49^ 


1 4f^9 


Frm 

r wo 


B-2 


H27 


19.7 


rijXCl UQcQ QUt; LU piUlllllitilll 














UlUcHJ. diUbUl JJtliJll llllco lltJU OjIIU, 














UlUt; olQc I . 


[xiijoyj iDou-t-o( ( 


1.4 / D 


r Wo 


R 1 




98 




pr^ m 1 7_i_9i Q 


i.4yo 


r Wo 


B-2 


H27 


19.6 


JVlany narrow absorption tea- 














LLllCb. JTvCLl-hlLlU lib lldjS cl JJUUliy~ 














UclldVtJU lldllUW CUllipUlltJlll. 


FTTRJ^Ql 074^ fi7^ 
ixiooy 1 u 1 ^o-u / o 


1 "^10 
l.O-LU 


FOCI 
r wo 


C-2 


H27 


22.0 


J_jdJ.gc ocCtlUll Ul 1111c opullOUoiy 














lldigigcLl Uy ICdLUlC LlcLrCCLlUIl ill 














Tiill fii" Rliip qiHp ot fii" OTiorl 

lUll lit. UlUC OliJ-C L*! lit OOlll gULiLl. 


T ROQ 1 n9f; nn/i c^R 


i.ooi 


FHQ 
r Wo 


B-3 


H27 


22.6 


Excluded due to misaligned 














sub- spectra. 


[HB89] 1115+080 


1.735 


FOS 


B-2 


H40 


22.5 


Excluded due to absorption fea- 














tures (in both Ly-alpha and 














NV). 


CTS 0286 


2.545 


FOS 


B-1 


L65 


23.1 


Excluded due to high redshift 














and low resolution. 
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Table 1 - continued from previous page 



Name z Instrument Aperture Grating S/N Comments on spectra 

(NED) (NED) (mean) and fits. 

[HB89] 1413+117 2.558 EOS B^l H40 205 Excluded due to broad absorp- 

tion features. 

MAPS-NGP 3.620 EOS B-2 H57 24.2 Excluded due to high blue-side 

0.382.0380226 IGM absorption. 



Table 3: Best-fit spectral parameters from fits to the full line profile. 
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Table 4: Best-fit spectral parameters from fits to the red side only of the 
Lyman- a line. 
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